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REMARKS 

Claims 21-30 and 32-40 are pending in the application. Claims 32-34 and 38-40 are 
withdrawn as being drawn to non-elected inventions. Claim 31 has been cancelled. Claims 21-31 and 
35-37 are under consideration. Claims 21, 22, 25, 29, 30, and 36 have been amended to further 
clarify the intended subject matter of the claimed invention. Entry of these amendments is respectfully 
requested. Applicants reserve the nght to prosecute non-elected subject matter in subsequent 
divisional applications. 

Information Disclosure Statement 

Although not required for the submission of an information disclosure statement, Applicants 
have enclosed an alignment of the sequence of SEQ ID NO: 12 with the sequences of references 1 and 
2 from the IDS filed October 21, 2002 in order to clarify the relevance of the reference sequences 
(please see CLUSTALW alignment attached as Exhibit A). 

Amendments to the specification 

Priority 

The specification has been amended to add the priority information as suggested by the 
Examiner in order to comply with 35 U.S.C. § 1 19(e) and 37 C.F.R. § 1.78. Applicants previously 
made a proper claim to priority under Article 8 of the Patent Cooperation Treaty (pages 1-2 of the 
Declaration and Power of Attorney filed May 8, 2001). 

Table 2 

Tabic 2 has been amended to clarify that SEQ ID NO; 12 shows homology to sequence 
AAR32188 of the GENESEQ database. In the application as originally filed, the sequence 
homologous to SEQ ID NO: 12 was listed in Table 2 as R32188. This amendment clarifies which 
database and sequence entry were intended. No new matter is added to the specification by this 
amendment. Therefore, entry of this amendment is respectfully requested. 
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Comments regarding restriction requirement 

Applicants affirm the election with traverse of claims 21-31, and 35-37, corresponding to the 
invention of Group A, and the election with traverse of SEQ ID NO: 12 and SEQ ID NO:25. Claims 
32-34, 41, 42, and 38-40 are "method of use" claims which depend from the elected polynucleotide 
and polypeptide product claims. Therefore, upon allowance of the polynucleotide and polypeptide 
product claims, it is believed that claims 32-34, 41, 42, and 38-40 should be rejoined and considered 
in accordance with the Commissioner's Notice in the Official Gazette of March 26, entitled "Guidance 
on Treatment of Product and Process Claims in light of In re Ochiai, In re Brouwer and 35 U.S.C.§ 
103(b)." 

Objections to the claims 

Claims 21, 22, 25, 29, 30, and 36 have been amended to address the objections. These 
claims, as currently pending, refer only to the elected polynucleotides and polypeptides of Group A. 
Withdrawal of the objection is therefore requested. 

Utility Rejections under 35 U.S.C. §101 and §112. First Paragraph 

Claims 21-31 and 35-37 are rejected under 35 U.S.C. §§ 101 and 112, first paragraph, based 
on the allegation that the claimed invention lacks patentable utility. The Office Action alleges in 
particular that "the claimed invention is not supported by either a specific and substantial utility or a well 
established utility' 1 (Office Action, page 5). Applicants respectfully traverse the rejections. 

The rejection of claims 21-31 and 35-37 is improper, as the inventions of those claims 
have a patentable utility as set forth in the instant specification, and/or a utility well known to 
one of ordinary skill in the art. 

The invention at issue comprises polynucleotides expressed in hematopoietic/immune, 
gastrointestinal, cardiovascular, and reproductive tissues (Specification, e.g., at page 6 and Table 3). 
The invention also comprises polypeptides encoded by the claimed polynucleotides. The claimed 
polypeptides are identified in the patent application as human cell surface receptor proteins, abbreviated 
as HCSRP. As such, the claimed invention has numerous practical, beneficial uses in toxicology testing. 
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drug development, and the diagnosis of disease, none of which require knowledge of how the 
polypeptide actually functions. 

The similarity of the claimed polypeptide to another polypeptide of known, undisputed utility by 
itself demonstrates utility beyond the reasonable probability required by law. HCSRP-12 is, in that 
regard, homologous to non-CD4 glycoprotein gpl20 receptor (GENESEQ AAR32188) 
(Specification, e.g., at Table 2). In particular, SEQ ID NO: 12 shares 86% sequence identity with the 
gpl20 receptor. 

This is more than enough homology to demonstrate a reasonable probability that the utility of 
the gpl20 receptor can be imputed to the claimed invention. It is well-known that the probability that 
two unrelated polypeptides share more than 40% sequence homology over 70 amino acid residues is 
exceedingly small. Brenner et aU Proc. Natl. Acad. Sci. U.S.A. 95:6073-78 (1998). Given 
homology in excess of 40% over many more than 70 amino acid residues, the probability that the 
claimed polypeptide is related to the gp!20 receptor is, accordingly, very high. 

The fact that the claimed polypeptide is a member of the C-type lectin receptor family alone 
demonstrates utility. Each of the members of this class, regardless of their particular functions, are 
useful. There is no evidence that any member of this class of polypeptides, let alone a substantial 
number of them, would not have some patentable utility. It follows that there is a more than substantial 
likelihood that the claimed polypeptide also has patentable utility, regardless of its actual function. The 
law has never required a patentee to prove more. 

There is, in addition, direct proof of the utility of the claimed invention. Applicants submit with 
this paper the Declarations of Bedilion and Furness describing some of the practical uses of the claimed 
invention in gene and protein expression monitoring applications as they would have been understood at 
the tunc of the patent application. 

The Bedilion Declaration describes, in particular, how the claimed expressed polynucleotide 
can be used in gene expression monitoring applications that were well-known at the time the patent 
application was filed, and how those applications are useful in developing drugs and monitoring their 
activitv. Dr. Bedilion states that the claimed invention is a useful tool when employed as a highly 
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Persons skilled in the art would appreciate that cDNA microarrays that contained the SEQ ID 
NO: 12-encoding polynucleotides would be a more useful tool than cDNA microarrays that did 
not contain the polynucleotides in connection with conducting gene expression monitoring 
studies on proposed (or actual) drugs for treating cell proliferative disorders, immune system 
disorders, infections, and neuronal disorders for such purposes as evaluating their efficacy and 
toxicity. 

The Patent Examiner does not dispute that the claimed polynucleotide can be used as a probe 
in cDNA microarrays and used in gene expression monitoring applications. Instead, the Patent 
Examiner contends that the claimed polynucleotide cannot be useful without precise knowledge of its 
biological function. But the law never has required knowledge of biological function to prove utility. It 
is the claimed invention's uses, not its functions, that are the subject of a proper analysis under the utility 
requirement. 

In any event, as demonstrated by the Bedilion Declaration, the person of ordinary skill in the art 
can achieve beneficial results from the claimed polynucleotide in the absence of any knowledge as to 
the precise function of the protein encoded by it. The uses of the claimed polynucleotide in gene 
expression monitoring applications are in fact independent of its precise function. 

The Furness Declaration describes, in particular, how the claimed polypeptide can be used in 
protein expression analysis techniques such as 2-D PAGE gels and western blots. Using the claimed 
invention with these techniques, persons of ordinary skill in the art can better assess, for example, the 
potential toxic affect of a drug candidate. (Furness Declaration at % [1 1]). 

The Patent Examiner does not dispute that the claimed polypeptide can be used in 2-D PAGE 
gels and western blots to perform drug toxicity testing. Instead, the Patent Examiner contends that the 
claimed polypeptide cannot be useful without precise knowledge of its function. But the law never has 
required knowledge of biological function to prove utility. It is the claimed invention's uses, not its 
functions, that arc the subject of a proper analysis under the utility requirement. 

In any event, as demonstrated by the Furness Declaration, the person ol ordinary skill in the art 
can achieve beneficial results from the claimed polypeptide in the absence of any knowledge as to the 
precise function of the protein. The uses of the claimed polypeptide for gene expression monitoring 
applications including loxieoloev testine arc in fact independent ol its precise function. 
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I. The Applicable Legal Standard 

To meet the utility requirement of sections 101 and 1 12 of the Patent Act, the patent applicant 

need only show that the claimed invention is "practically useful;' Anderson v. Natta, 480 F.2d 1392, 

1397, 178 USPQ 458 (CCPA 1973) and confers a "specific benefit" on the public. Brenner v. 

Manson, 383 U.S. 519, 534-35, 148 USPQ 689 (1966). As discussed in a recent Court of Appeals 

for the Federal Circuit case, this threshold is not high: 

An invention is "useful" under section 101 if it is capable of providing some identifiable benefit. 
See Brenner v. Manson, 383 U.S. 519, 534 [148 USPQ 689] (1966); Brooktree Corp, v. 
Advanced Miero Devices, Inc., 977 F.2d 1555, 1571 [24 USPQ2d 1401] (Fed. Cir. 1992) 
("to violate Section 101 the claimed device must be totally incapable of achieving a useful 
result"); Fuller v. Berger, 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention 
"is incapable of serving any beneficial end"). 

Juicy Whip Inc. v. Orange Bang Inc., 51 USPQ2d 1700 (Fed. Cir. 1999). 

While an asserted utility must be described with specificity, the patent applicant need not 

demonstrate utility to a certainty. In Stiftung v. Renishaw PLC, 945 F.2d 1 173, 1 180, 20 USPQ2d 

1094 (Fed. Cir. 1991), the United States Court of Appeals for the Federal Circuit explained: 

An invention need not be the best or only way to accomplish a certain result, and it need only 
be useful to some extent and in certain applications: "[T]he fact that an invention has only limited 
utility and is only operable in certain applications is not grounds for finding lack of utility." 
Envirotech Corp. v. Al George, Inc., 730 F.2d 753, 762, 221 USPQ 473, 480 (Fed. Cir. 
1984). 

The specificity requirement is not, therefore, an onerous one. If the asserted utility is described 
so that a person of ordinary skill in the art would understand how to use the claimed invention, it is 
sufficiently specific. See Standard Oil Co. v. Montedison, S.p.a., 212 U.S.P.Q. 327, 343 (3d Cir. 
1981 ). The specificity requirement is met unless the asserted utility amounts to a "nebulous expression" 
such as "biological activity" or "biological properties" that does not convey meaningful information 
about the utility of what is being claimed. Cross v. Iizuka, 753 F.2d 1040, 1048 (Fed. Cir. 1985). 

In addition to conferring a specific benefit on the public, the benefit must also be "substantial." 
Brenner. 383 U.S. at 534. A "substantial" utility is a practical, "real-world" utility. Nelson v. Bowler, 
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If persons of ordinary skill in the art would understand that there is a "well-established" utility 
for the claimed invention, the threshold is met automatically and the applicant need not make any 
showing to demonstrate utility. Manual of Patent Examination Procedure at § 706.03(a). Only if there 
is no "well-established" utility for the claimed invention must the applicant demonstrate the practical 
benefits of the invention, Id. 

Once the patent applicant identifies a specific utility, the claimed invention is presumed to 
possess it. In re Cortright, 165 F.3d 1353, 1357, 49 USPQ2d 1464 (Fed. Cir. 1999); In re Brana, 
51 F.3d 1560, 1566; 34 USPQ2d 1436 (Fed. Cir. 1995). In that case, the Patent Office bears the 
burden of demonstrating that a person of ordinary skill in the art would reasonably doubt that the 
asserted utility could be achieved by the claimed invention. Id. To do so, the Patent Office must 
provide evidence or sound scientific reasoning. See In re hanger, 503 F.2d 1380, 1391-92, 183 
USPQ 288 (CCPA 1974). If and only if the Patent Office makes such a showing, the burden shifts to 
the applicant to provide rebuttal evidence that would convince the person of ordinary skill that there is 
sufficient proof of utility. Brana. 51 F.3d at 1566. The applicant need only prove a "substantial 
likelihood" of utility; certainty is not required. Brenner, 383 U.S. at 532. 

II. Uses of the claimed polypeptides and polynucleotides for diagnosis of conditions and 
disorders characterized by expression of HCSRP, for toxicology testing, and for drug 
discovery are sufficient utilities under 35 U.S.C. §§ 101 and 112, first paragraph 

The claimed invention meets all of the necessary requirements for establishing a credible utility 
under the Patent Law: There are "well-established" uses for the claimed invention known to persons of 
ordinary skill in the art, and there are specific practical and beneficial uses for the invention disclosed in 
the patent application's specification. These uses are explained, in detail, in the Bedilion Declaration 
and the Furness Declaration accompanying this paper. Objective evidence, not considered by the 
Patent Office, further corroborates the credibility of the asserted utilities. 
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A. The use of HCSRP for toxicology testing, drug discovery, and disease 
diagnosis are practical uses that confer "specific benefits" to the public 

The claimed invention has specific, substantial, real-world utility by virtue of its use in toxicology 
testing, drug development and disease diagnosis through gene expression profiling. These uses are 
explained in detail in the accompanying Bedilion Declaration and Furness Declaration, the substance of 
which is not rebutted by the Patent Examiner. There is no dispute that the claimed polynucleotide is in 
fact a useful tool in cDNA microarrays used to perform gene expression analysis and that the claimed 
polypeptide is a useful tool in two-dimensional polyacrylamide gel electrophoresis C2-D PAGE") 
analysis and western blots used to monitor protein expression and assess drug toxicity. These uses are 
sufficient to establish utilities for the claimed polynucleotide and polypeptide, respectively. 

The instant application is a divisional of, and claims priority to, United States Provisional Patent 
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application' 1 ). 

1. The Bedilion Declaration 

In his Declaration, Dr. Bedilion explains the many reasons why a person skilled in the art 
reading the Tang et al. '404 application on March 8, 1999 would have understood that application to 
disclose the claimed polynucleotide to be useful for a number of gene expression monitoring 
applications, e.g., as a highly specific probe for the expression of that specific polynucleotide in 
connection with the development of drugs and the monitoring of the activity of such drugs. (Bedilion 
Declaration at, e.g., ffl 10-15). Much, but not all, of Dr. Bedilion's explanation concerns the use of the 
claimed polynucleotide in cDNA microarrays of the type first developed at Stanford University for 
evaluating the efficacy and toxicity of drugs, as well as for other applications. (Bedilion Declaration, f|| 
12 and KM. 1 

In connection with his explanations. Dr. Bedilion states that the k Tang et al. ^404 specification 



! Dr. Bedilion also explained, for example, why persons skilled in the art would also appreciate, 
based on the Tang et al. '404 specification, that the claimed polynucleotide would be useful in 
i'onn<v!ion with JovHonin" new dm^s nsim* technolo^v. such as Northern analvsis. that predated bv 
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would have led a person skilled in the art on March 8, 1999 who was using gene expression monitoring 
in connection with working on developing new drugs for the treatment of cell proliferative disorders, 
immune system disorders, infections, and neuronal disorders [a] to conclude that a cDNA microarray 
that contained the SEQ ID NO: 12-encoding polynucleotides would be a highly useful tool, and [b] to 
request specifically that any cDNA microarray that was being used for such purposes contain the SEQ 
ID NO: 12-encoding polynucleotides" (Bedilion Declaration,! 15 ). For example, as explained by Dr. 
BediHon, "[p]ersons skilled in the art would [have appreciated on March 8, 1999] that a cDNA 
microarray that contained the SEQ ID NO: 12-encoding polynucleotides would be a more useful tool 
than a cDNA microarray that did not contain the polynucleotides in connection with conducting gene 
expression monitoring studies on proposed (or actual) drugs for treating cell proliferative disorders, 
immune system disorders, infections, and neuronal disorders for such purposes as evaluating their 
efficacy and toxicity." Id. 

In support of those statements, Dr. Bedilion provided detailed explanations of how cDNA 
technology can be used to conduct gene expression monitoring evaluations, with extensive citations to 
pre-March 8, 1999 publications showing the state of the art on March 8, 1999. (Bedilion Declaration, 
\ \ 10-14). While Dr. Bedilion's explanations in paragraph 15 of his Declaration include more than 
three pages of text and six subparts (a)-(f), he specifically states that his explanations are not "all- 
inclusive." Id, For example, with respect to toxicity evaluations, Dr. Bedilion had earlier explained 
how persons skilled in the art who were working on drug development on March 8, 1999 (and for 
several years prior to March 8, 1999) "without any doubt" appreciated that the toxicity (or lack of 
toxicity) of any proposed drug was "one of the most important criteria to be evaluated in connection 
with the development of the drug" and how the teachings of the Tang et al. '404 application clearly 
include using differential gene expression analyses in toxicity studies (Bedilion Declaration, ( || 10). 

Thus, the Bedilion Declaration establishes that persons skilled in the art reading the Tang et al. 
'404 application at the time it was filed "would have wanted their cDNA microarray to have a [SEQ ID 
NO: 12-encoding polynucleotide probe] because a microarray that contained such a probe (as 
compared to one that did not) would provide more useful results in the kind of gene expression 
m.^nitnnnn dn/iuH; nuno /*n\!/\ microurravs that nersons skilled in the art have been doing since well 
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prior to March 8, 1999" (Bedilion Declaration, i 15, item (0). This, by itself, provides more than 
sufficient reason to compel the conclusion that the Tang et al. '404 application disclosed to persons 
skilled in the art at the time of its filing substantial, specific and credible real-world utilities for the 
claimed polynucleotide. 

Nowhere does the Patent Examiner address the fact that, as described on p. 33 of the Tang et 
al. k 404 application, the claimed polynucleotides can be used as highly specific probes in, for example, 
cDNA microarrays - probes that without question can be used to measure both the existence and 
amount of complementary RNA sequences known to be the expression products of the claimed 
polynucleotides. The claimed invention is not, in that regard, some random sequence whose value as a 
probe is speculative or would require further research to determine. 

Given the fact that the claimed polynucleotide is known to be expressed, its utility as a 
measuring and analyzing instrument for expression levels is as indisputable as a scale's utility for 
measuring weight. This use as a measuring tool, regardless of how the expression level data ultimately 
would be used by a person of ordinary skill in the art, by itself demonstrates that the claimed invention 
provides an identifiable, real-world benefit that meets the utility requirement. Raytheon v. Roper, 724 
F.2d 951, (Fed. Cir. 1983) (claimed invention need only meet one of its stated objectives to be useful ); 
In re Cortwright, 165 F.3d 1353, 1359 (Fed. Cir. 1999) (how the invention works is irrelevant to 
utility); MPEP § 2107 ("Many research tools such as gas chromatographs, screening assays, and 
nucleotide sequencing techniques have a clear, specific, and unquestionable utility (e.g., they are useful 
in analyzing compounds )" (emphasis added)). 

The Bedilion Declaration shows that a number of pre-March 8, 1999 publications confirm and 
further establish the utility of cDNA microarrays in a wide range of drug development gene expression 
monitoring applications at the time the Tang et al. fc 404 application was filed (Bedilion Declaration f\\ 
10-14; Bedilion Exhibits A-G). Indeed, Brown and Shalon U.S. Patent No. 5,807,522 (the Brown 
\522 patent, Bedilion Exhibit D), which issued from a patent application filed in June 1995 and was 
effectively published on December 29, 1995 as a result of the publication of a PCT counterpart 
application, shows that the Patent Office recognizes the patentable utility of the cDNA technology 
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developed in the early to mid-1990s. As explained by Dr. Bedilion, among other things (Bedilion 
Declaration, 1 12): 

The Brown '522 patent further teaches that the "[m]icroarrays of immobilized nucleic 
acid sequences prepared in accordance with the invention" can be used in "numerous" 
genetic applications, including "monitoring of gene expression" applications (see 
Bedilion Tab D at col. 14, lines 36-42). The Brown '522 patent teaches (a) monitoring 
gene expression (i) in different tissue types, (ii) in different disease states, and (iii) in 
response to different drugs, and (b) that arrays disclosed therein may be used in 
toxicology studies (see Bedilion Tab D at col. 15, lines 13-18 and 52-58 and col. 18, 
lines 25-30). 

Literature reviews published shortly after the filing of the Tang et al. '404 application describing 
the state of the art further confirm the claimed invention's utility. Rockett et al. confirm, for example, 
that the claimed invention is useful for differential expression analysis regardless of how expression is 
regulated: 

Despite the development of multiple technological advances which have recently 
brought the field of gene expression profiling to the forefront of molecular analysis, 
recognition of the importance of differential gene expression and characterization of 
differentially expressed genes has existed for many years. 

* * * 

Although differential expression technologies are applicable to a broad range of models, 
perhaps their most important advantage is that, in most cases, absolutely no prior 
knowledge of the specific genes which are up- or down-regulated is required. 

* * # 

Whereas it would be informative to know the identity and functionality of all genes 
up/down regulated by . . . toxicants, this would appear a longer term goal .... 
However, the current use of gene profiling yields a pattern of gene changes tor a 
xenobiotic of unknown toxicity which may be matched to that of well characterized 
toxins, thus alerting the toxicologist to possible in vivo similarities between the unknown 
and the standard, thereby providing a platform for more extensive toxicological 
examination, (emphasis added) 

Rockett et al.. Differential gene expression in drug metabolism and toxicology; practicalities, problems 
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In another pre-March 8, 1999 article, Lashkan et al. state explicitly that sequences that are 

merely "predicted" to be expressed (predicted Open Reading Frames, or ORFs) - the claimed 

invention in fact is known to be expressed - have numerous uses: 

Efforts have been directed toward the amplification of each predicted ORF or any 
other region of the genome ranging from a few base pairs to several kilobase pairs. 
There are many uses for these amplicons- they can be cloned into standard vectors or 
specialized expression vectors, or can be cloned into other specialized vectors such as 
those used for two-hybrid analysis. The amplicons can also be used directly by, for 
example, arraying onto glass for expression analysis , for DNA binding assays, or for 
any direct DNA assay. 

Lashkari et al., Whole genome analysis: Experimental access to all genome sequenced segments 
through larger-scale efficient oligonucleotide synthesis and PCR , 94 Proc. Nat. Acad. Sci. 8945 (Aug. 
1997) (emphasis added). 

2. The Furness Declaration 

In his Declaration, Mr. Furness explains the many reasons why a person skilled in the art who 
read the Tang et al. '404 application on March 8, 1999 would have understood that application to 
disclose the claimed polypeptide to be useful for a number of gene and protein expression monitoring 
applications, e.g., in 2-D PAGE technologies, in connection with the development of drugs and the 
monitoring of the activity of such drugs. (Furness Declaration at, e.g., [11-15]). Much, but not all, 
of Mr. Furness' explanation concerns the use of the claimed polypeptide in the creation of protein 
expression maps using 2-D PAGE. 

2-D PAGE technologies were developed during the 1980's. Since the early 1990's, 2-D 
PAGE has been used to create maps showing the differential expression of proteins in different cell 
types or in similar cell types in response to drugs and potential toxic agents. Each expression pattern 
reveals the state of a tissue or cell type in its given environment, in the presence or absence of a 
drug. By comparing a map of cells treated with a potential drug candidate to a map of cells not treated 
with the candidate, for example, the potential toxicity of a drug can be assessed. Furness Declaration 
at ( |[ fill) 
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The claimed invention makes 2-D PAGE analysis a more powerful tool for toxicology and drug 
efficacy testing. A person of ordinary skill in the art can derive more information about the state or 
states or tissue or cell samples from 2-D PAGE analysis with the claimed invention than without it. As 
Mr. Furness explains: 

In view of the Tang et al. fc 404 application, the Wilkins article, and other related pre- 
March, 1999 publications, persons skilled in the art on March 8, 1999 clearly would 
have understood the Tang et al. '404 application to disclose the SEQ ID NO: 12 
polypeptide to be useful in 2-D PAGE analyses for the development of new drugs and 
monitoring the activities of drugs for such purposes as evaluating their efficacy and 
toxicity. . . . (Furness Declaration, |10) 

* * * 

Persons skilled in the art would appreciate that a 2-D PAGE map that utilized the SEQ 
ID NO: 12 polypeptide sequence would be a more useful tool than a 2-D PAGE map 
tk.^* A\A i,fii;-7A tkic nrAtpin uviiipnrp in rnnnprtinn with rnndnctinp nrotein 
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expression monitoring studies on proposed (or actual) drugs for treating cell 
proliferative disorders, immune system disorders, infections, and neuronal disorders for 
such purposes as evaluating their efficacy and toxicity. (Furness Declaration, 112) 

Mr. Furness' observations are confirmed in the literature published before the filing of the 

patent application. Wilkins, for example, describes how 2-D gels are used to define proteins present in 

various tissues and measure their levels of expression, the data from which is in turn used in databases: 

For proteome projects, the aim of [computer-aided 2-D PAGE] analysis ... is to 
catalogue all spots from the 2-D gel in a qualitative and if possible quantitative manner, 
so as to define the number of proteins present and their levels of expression. Reference 
gel images, constructed from one or more gels, for the basis of two-dimensional gel 
databases. (Wilkins, Tab C, p. 26). 

B. The use of nucleic acids coding for proteins expressed by humans as tools for 
toxicology testing, drug discovery, and the diagnosis of disease is now "well- 
established" 

The technologies made possible by expression profiling and the DNA tools upon which they 
rely arc now well-established. The technical literature recognizxs not only the prevalence of these 
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technologies, but also their unprecedented advantages in drug development, testing and safety 
assessment. These technologies include toxicology testing, as described by Bedilion in his Declaration. 

Toxicology testing is now standard practice in the pharmaceutical industry. See, e.g., John C. 
Rockett et al., supra: 

Knowledge of toxin-dependent regulation in target tissues is not solely an academic pursuit as 
much interest has been generated in the pharmaceutical industry to harness this technology in 
the early identification of toxic drug candidates, thereby shortening the developmental process 
and contributing substantially to the safety assessment of new drugs. 

To the same effect are several other scientific publications, including Emile F. Nuwaysir et al., 

Microarravs and Toxicology: The Advent of Toxicogenomics , 24 Molecular Carcinogenesis 153 

(1999); Sandra Steiner and N. Leigh Anderson, Expression profiling in toxicology -- potentials and 

limitations , 112-13 Toxicology Letters 467 (2000). 

Nucleic acids useful for measuring the expression of whole classes ot genes are routinely 

incorporated for use in toxicology testing. Nuwaysir et al. describes, for example, a Human ToxChip 

comprising 2089 human clones, which were selected 

for their well-documented involvement in basic cellular processes as well as their responses to 
different types of toxic insult. Included on this list are DNA replication and repair genes, 
apoptosis genes, and genes responsive to PAHs and dioxin-like compounds, peroxisome 
proliferators, estrogenic compounds, and oxidant stress. Some of the other categories of genes 
include transcription factors, oncogenes, tumor suppressor genes, cyclins, kinases, 
phosphatases, cell adhesion and motility genes, and homeobox genes. Also included in this 
group are 84 housekeeping genes, whose hybridization intensity is averaged and used for signal 
normalization of the other genes on the chip. 

See also Table 1 of Nuwaysir et al. (listing additional classes of genes deemed to be of special interest 

in making a human toxicology microarray). 

The more genes that are available for use in toxicology testing, the more powerful the technique. 

"Arrays are at their most powerful when they contain the entire genome of the species they are being 

used to study/' John C. Rockett and David J. Dix, Application of DNA Arrays to Toxicology . 107 

Environ. Health Perspec.681, No. 8 (1999). Control genes are carefully selected for their stability 

across a large set of array experiments in order to best study the effect of toxicologic^ compounds. 

Sec attached email from the primary investigator on the Nuwaysir paper. Dr. Cynthia Alshan. to an 
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Incyte employee, dated July 3, 2000, as well as the original message to which she was responding, 
indicating that even the expression of carefully selected control genes can be altered. Thus, there is no 
expressed gene which is irrelevant to screening for toxicological effects, and all expressed genes have a 
utility for toxicological screening. 

In fact, the potential benefit to the public, in terms of lives saved and reduced health care costs, 
are enormous. Recent developments provide evidence that the benefits of this information are already 
beginning to manifest themselves. Examples include the following: 

In 1999, CV Therapeutics, an Incyte collaborator, was able to use Incyte gene 
expression technology, information about the structure of a known transporter gene, 
and chromosomal mapping location, to identify the key gene associated with Tangiers 
disease. This discovery took place over a matter of only a few weeks, due to the 
power of these new genomics technologies. The discovery received an award from the 
American Heart Association as one of the top 10 discoveries associated with heart 
disease research in 1999. 

In an April 9, 2000, article published by the Bloomberg news service, an Incyte 
customer stated that it had reduced the time associated with target discovery and 
validation from 36 months to 18 months, through use of Incyte' s genomic information 
database. Other Incyte customers have privately reported similar experiences. The 
implications of this significant saving of time and expense for the number of drugs that 
may be developed and their cost are obvious. 

In a February 10, 2000, article in the Wall Street Journal, one Incyte customer stated 
that over 50 percent of the drug targets in its current pipeline were derived from the 
Incyte database. Other Incyte customers have privately reported similar experiences. 
By doubling the number of targets available to pharmaceutical researchers, Incyte 
genomic information has demonstrably accelerated the development of new drugs. 

C. The use of proteins expressed by humans as tools for toxicology testing, drug 
discovery, and the diagnosis of disease is now "well-established" 

The technologies made possible by expression profiling using polypeptides are now well- 
established. The technical literature recognizes not only the prevalence of these technologies, but also 
their unprecedented advantages in drug development, testing and safety assessment. These 
technologies include toxicology testing, as described by Furness in his Declaration. 
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Toxicology testing is now standard practice in the pharmaceutical industry. See, e.g., John C. 

Rockett, et <//., Differential gene expression in drug metabolism and toxicology: practicalities, 

problems, and potential , Xenobiotica 29:655-691 (July 1999) (Reference No. 1): 

Knowledge of toxin-dependent regulation in target tissues is not solely an academic 
pursuit as much interest has been generated in the pharmaceutical industry to harness 
this technology in the early identification of toxic drug candidates, thereby shortening the 
developmental process and contributing substantially to the safety assessment of new 
drugs. ((Reference No. 1), page 656) 
To the same effect are several other scientific publications, including Emile F. Nuwaysir, et al., 

Microarrays and Toxicology: The Advent of Toxicogenomics , Molecular Carcinogenesis 24:153-159 

(1999) (Reference No. 2); Sandra Steiner and N. Leigh Anderson, Expression profiling in toxicology - 

- potentials and limitations , Toxicology Letters 1 12-13:467-471 (2000) (Reference No. 3). 

The more genes - and, accordingly, the polypeptides they encode - that are available for use 
in toxicology testing, the more powerful the technique. Control genes are carefully selected for their 
stability across a large set of array experiments in order to best study the effect of toxicological 
compounds. See attached email from the primary investigator, Dr. Cynthia Afshari to an Incyte 
employee, dated July 3, 2000, as well as the original message to which she was responding (Reference 
No. 4) Thus, there is no expressed gene which is irrelevant to screening for toxicological effects, and 
all expressed genes have a utility for toxicological screening. 

In fact, the potential benefit to the public, in terms of lives saved and reduced health care costs, 

are enormous. Recent developments provide evidence that the benefits of this information are already 

beginning to manifest themselves. Examples include the following: 

In 1999, CV Therapeutics, an Incyte collaborator, was able to use Incyte gene 
expression technology, information about the structure of a known transporter gene, 
and chromosomal mapping location, to identify the key gene associated with Tangier 
disease. This discovery took place over a matter of only a few weeks, due to the 
power of these new genomics technologies. The discovery received an award from the 
American Heart Association as one of the top 10 discoveries associated with heart 
disease research in 1999. 

In an April 9, 2000, article published by the Bloomberg news service, an Incyte 
customer stated that it had reduced the time associated with target discovery and 
validation from 36 months to 18 months, throueh use of Incvtc's Genomic information 
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database. Other Incyte customers have privately reported similar experiences. The 
implications of this significant saving of time and expense for the number of drugs that 
may be developed and their cost are obvious. 

In a February 10, 2000, article in the Wall Street Journal, one Incyte customer stated 
that over 50 percent of the drug targets in its current pipeline were derived from the 
Incyte database. Other Incyte customers have privately reported similar experiences. 
By doubling the number of targets available to pharmaceutical researchers, Incyte 
genomic information has demonstrably accelerated the development of new drugs. 

Because the Patent Examiner failed to address or consider the "well-established' 1 utilities for the 
claimed invention in toxicology testing, drug development, and the diagnosis of disease, the Examiner's 
rejections should be overturned regardless of their merit. 

D. The similarity of the claimed polypeptide to another of undisputed utility 
demonstrates utility 

Because there is a substantial likelihood that the claimed HCSRP is functionally related to the 
gpl20 receptor, a polypeptide of undisputed utility, there is by implication a substantia] likelihood that 
the claimed polypeptide and the polynucleotide that encodes it are similarly useful. Applicants need not 
show any more to demonstrate utility. In re Brana, 51 F.3d at 1567. 

It is undisputed that the claimed polypeptide shown as SEQ ID NO: 12 in the patent application 
and referred to as HCSRP- 12 shares more than 85% sequence identity over 325 amino acid residues 
with the gpl20 receptor (GENES EQ AAR32188, International Patent WO 93/01820). The gpl20 
receptor belongs to the C-lectin receptor family whose members are known to mediate cellular 
immunity in part through carbohydrate recognition on microorganisms. Members of this family have 
been shown to bind glycoproteins on the viral envelopes of human immunodeficiency virus (HIV) and 
Ebola virus (enclosed references of Curtis et aL Turville et aL and Alvarez et aL). Indeed, HCSRP- 12 
shows homology to other members of the C-lectin receptor family. The Examiner's attention is 
directed to Exhibit B, which shows a recent BLAST analysis of SEQ ID NO: 12. The top hits include 
human L-SIGN (g 13383470 and human mDC-SKJN type I isoform (gl 538306), which except for a 
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few sequence insertions, share 99.7% identity with SEQ ID NO: 12. Both L-SIGN and DC-SIGN 
are known to bind to HIV gpl20 and Ebola virus glycoproteins (enclosed references of Turville 
et aL Bashirova et al., and Alvarez et al.). The alignment of HCSRP-12 with human L-SIGN 
and mDC-SIGN corroborates the original determination of the instant application that HCSRP- 
12 was correctly assigned to the class of receptors that bind to HIV envelope glycoprotein gpl20. 

The attention of the Examiner is directed to Exhibit C which shows that SEQ ID NO: 12 
contains a C-type lectin domain from about residue S21 1 to residue K317 as determined by 
recent HMMER, MOTIFS, and BLIMPS analyses. Exhibit D shows an alignment of SEQ ID 
NO: 12 with the sequences of the gpl20 receptor (GENESEQ AAR32188), a 
membrane-associated C-type lectin that binds human immunodeficiency virus envelope 
glycoprotein gpl20 (g8572543), and L-SIGN (gl3383470), performed using the program, 
MEGALIGN version 4.05 and the CLUSTAL V algorithm. This alignment shows the presence of 
conserved residues, particularly in the region of SEQ ID NO: 12 corresponding to the lectin 
domain. In all of these proteins, a C-type lectin domain is believed to mediate carbohydrate 
recognition and binding to the HIV envelope glycoprotein, gpl20. 

The homology among these sequences is more than enough to demonstrate a reasonable 
probability that the utility of the gpl20 receptor can be imputed to the claimed invention. It is 
well-known that the probability that two unrelated polypeptides share more than 40% sequence 
homology over 70 amino acid residues is exceedingly small (Brenner et. al., Proc. Natl. Acad. 
Sci. (1998) 95:6073-78). Given homology in excess of 40% over many more than 70 amino acid 
residues, the probability that the claimed polypeptide is related to the gpl20 receptor is, 
accordingly, very high. 

It was known in the art at the time the application was filed that C-lectin receptors such as 
the gpl20 receptor could be useful for detection of virus, inhibition of viral infection, and for 
development of vaccines (enclosed references of Gcijtenbeck et al. and International patent WO 
93/01820). It was also known that infection with HIV is associated with an increased incidence 
of cancer, particularly with Kaposi's sarcoma and non-Hodgkin's lymphoma, and that gpl20 
plays a role in tumor metastasis (enclosed references of Scadden and Hodgson et al.). In 
addition, gp 1 20 induces neuronal apoptosis and neuronal injury associated with 
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the gpl20 receptor and C-lectin receptor proteins as a class, persons skilled in the art at the time the 
application was filed would have considered HCSRP-12 to be an important and valuable tool for use in 
research on cell proliferative disorders, immune system disorders, infections, and neuronal disorders. 

The Examiner must accept the Applicants' demonstration that the homology between the 
claimed invention and the gpl20 receptor demonstrates utility by a reasonable probability unless the 
Examiner can demonstrate through evidence or sound scientific reasoning that a person of ordinary skill 
in the art would doubt utility. See In re Lunger, 503 F.2d 1380, 1391-92, 183 USPQ 288 (CCPA 
1974). The Examiner has not provided sufficient evidence or sound scientific reasoning to the contrary. 

E. Objective evidence corroborates the utilities of the claimed invention 

There is, in fact, no restriction on the kinds of evidence a Patent Examiner may consider in 
determining whether a ''real-world' 1 utility exists. Indeed, "real-world" evidence, such as evidence 
showing actual use or commercial success of the invention, can demonstrate conclusive proof of utility. 
Raytheon v. Roper, 220 USPQ2d 592 (Fed. Cir. 1983); Nestle v. Eugene, 55 F.2d 854, 856, 12 
USPQ 335 (6th Cir. 1932). Indeed, proof that the invention is made, used or sold by any person or 
entity other than the patentee is conclusive proof of utility. United States Steel Corp. v. Phillips 
Petroleum Co., 865 R2d 1247, 1252, 9 USPQ2d 1461 (Fed. Cir. 1989). 

Over the past several years, a vibrant market has developed for databases containing all 
expressed genes (along with the polypeptide translations of those genes), in particular genes having 
medical and pharmaceutical significance such as the instant sequence. (Note that while the value in 
these databases is enhanced by their completeness, each sequence in them is independently valuable 
nonetheless.) The databases sold by Applicants 1 assignee, Incyte, include exactly the kinds of 
information made possible by the claimed invention, such as tissue and disease associations. Incyte 
sells its database containing the claimed sequence and millions of other sequences throughout the 
scientific community, including to pharmaceutical companies who use the information to develop new 
pharmaceuticals. 
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Both Incyte's customers and the scientific community have acknowledged that Incyte's 
databases have proven to be valuable in, for example, the identification and development of drug 
candidates. As Incyte adds information to its databases, including the information that can be generated 
only as a result of Incyte's discovery of the claimed polynucleotide and its use of that polynucleotide on 
cDNA microarrays, the databases become even more powerful tools. Thus the claimed invention adds 
more than incremental benefit to the drug discovery and development process. 

III. The Patent Examiner's Rejections Are Without Merit 

Rather than responding to the evidence demonstrating utility, the Examiner attempts to dismiss it 
altogether by arguing that the disclosed and well-established utilities for the claimed polynucleotide and 
polypeptide are not "specific and substantial" utilities. (Office Action at p. 5.) The Examiner is incorrect 
both as a matter of law and as a matter of fact. 

A. The Precise Biological Role Or Function Of An Expressed Polynucleotide or 
Polypeptide Is Not Required To Demonstrate Utility 

The Patent Examiner's primary rejection of the claimed invention is based on the ground that, 
without information as to the precise "biological role" of the claimed invention, the claimed invention's 
utility is not sufficiently specific. According to the Examiner, it is not enough that a person of ordinary 
skill in the art could use and, in fact, would want to use the claimed invention either by itself or in a 
microarray, 2-D gel or western blot to monitor the expression of genes for such applications as the 
evaluation of a drug's efficacy and toxicity. The Examiner would require, in addition, that the applicant 
provide a specific and substantial interpretation of the results generated in any given expression analysis. 

It may be that specific and substantial interpretations and detailed information on biological 
function are necessary to satisfy the requirements for publication in some technical journals, but they are 
not necessary to satisfy the requirements for obtaining a United States patent. The relevant question is 
not, as the Examiner would have it, whether it is known how or why the invention works. In re 
Cortwritfu. 165 F.3d 1353, 1359 (Fed. Cir. 1999), but rather whether the invention provides an 
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1364, 1366 (Fed. Cir. 1999). If the benefit exists, and there is a substantial likelihood the invention 
provides the benefit, it is useful. There can be no doubt, particularly in view of the Bedilion Declaration 
(at, e.g.,f\\ 10 and 15, Bedilion) and the Furness Declaration (at, e.g., f\\ 10-13), that the present 
invention meets this test. 

The threshold for determining whether an invention produces an identifiable benefit is low. 
Juicy Whip, 185 F.3d at 1366. Only those utilities that are so nebulous that a person of ordinary skill 
in the art would not know how to achieve an identifiable benefit and, at least according to the PTO 
guidelines, so-called "throwaway" utilities that are not directed to a person of ordinary skill in the art at 
all, do not meet the statutory requirement of utility. Utility Examination Guidelines, 66 Fed. Reg. 1092 
(Jan. 5,2001). 

Knowledge of the biological function or role of a biological molecule has never been required to 

show real- world benefit. In its most recent explanation of its own utility guidelines, the PTO 

acknowledged so much (66 F.R. at 1095): 

[T]he utility of a claimed DNA does not necessarily depend on the function of the 
encoded gene product. A claimed DNA may have specific and substantial utility 
because, e.g., it hybridizes near a disease-associated gene or it has gene-regulating 
activity. 

By implicitly requiring knowledge of biological function for any claimed nucleic acid, the 
Examiner has, contrary to law, elevated what is at most an evidentiary factor into an absolute 
requirement of utility. Rather than looking to the biological role or function of the claimed invention, the 
Examiner should have looked first to the benefits it is alleged to provide. 

B. Membership in a Class of Useful Products Can Be Proof of Utility 

Despite the uncontradicted evidence that the claimed polypeptide is related to the gp 1 20 
receptor, a member of the C-type lectin receptor family, whose members indisputably are useful, the 
Examiner refused to impute the utility of the gpl20 receptor to HCSRP-12. In the Office Action of 
January 15, 2003, the Patent Examiner takes the position that unless Applicants can identify which 
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particular biological function of the gpl20 receptor is possessed by HCSRP-12, utility cannot be 
imputed. 

In order to demonstrate utility by membership in a class, the law requires only that the class not 
contain a substantial number of useless members. So long as the class does not contain a substantial 
number of useless members, there is sufficient likelihood that the claimed invention will have utility, and 
a rejection under 35 U.S.C. § 101 is improper. That is true regardless of how the claimed invention 
ultimately is used and whether or not the members of the class possess one utility or many. See 
Brenner v. Manson, 383 U.S. 519, 532 (1966); Application of Kirk, 376 F.2d 936, 943 (CCPA 
1967). 

Membership in a "general" class is insufficient to demonstrate utility only if the class contains a 
sufficient number of useless members such that a person of ordinary skill in the art could not impute 
utility by a substantial likelihood. There wouid be, in that case, a substantial likelihood that the claimed 
invention is one of the useless members of the class. In the few cases in which class membership did 
not prove utility by substantial likelihood, the classes did in fact include predominately useless members. 
E.g., Brenner (man-made steroids); Kirk (same); Natta (man-made polyethylene polymers). 

The Examiner addresses HCSRP-12 as if the general class in which it is included is not the C- 
type lectin receptor family, but rather all polynucleotides or all polypeptides, including the vast majority 
of useless theoretical molecules not occurring in nature, and thus not pre-selected by nature to be 
useful. While these "general classes" may contain a substantial number of useless members, the C-type 
lectin receptor family does not. The C-type lectin receptor family is sufficiently specific to rule out any 
reasonable possibility that HCSRP-12 would not also be useful like the other members of the family. 

Because the Examiner has not presented any evidence that the C-type lectin receptor class of 
proteins has any, let alone a substantial number, of useless members, the Examiner must conclude that 
there is a "substantial likelihood" that the HCSRP-12 encoded by the claimed polynucleotides is useful. 
It follows that SEQ ID NO: 12 and SEQ ID NO:25 also are useful. 

Even if the Examiner's "common utility" criterion were correct - and it is not - the gp 1 20 
receptor would meet it. It is undisputed that known members of the C-type lectin receptor family, 
including the gp!20 receptor, junction in cellular immunity and host defense against viral infections. A 
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person of ordinary skill in the art need not know any more about how the claimed invention functions in 
cellular immunity and viral infections to use it, and the Examiner presents no evidence to the contrary. 
Instead, the Examiner makes the conclusory observation that a person of ordinary skill in the art would 
need to know whether, for example, any given gpl20 receptor functions in cellular immunity or viral 
infections. The Examiner then goes on to assume that the only use for HCSRP-12 absent knowledge 
as to how HCSRP-12 actually works is further study of HCSRP-12 itself. 

Not so. As demonstrated by Applicants, knowledge that HCSRP-12 is a C-type lectin related 
to the gpl20 receptor is more than sufficient to make it useful for the diagnosis and treatment of cell 
proliferative disorders, immune system disorders, infections, and neuronal disorders. The Examiner 
must accept these facts to be true unless the Examiner can provide evidence or sound scientific 
reasoning to the contrary. But the Examiner has not done so. 

C. Because the uses of polynucleotides encoding HCSRP in toxicology testing, 

drug discovery, and disease diagnosis are practical uses beyond mere study of 
the invention itself, the claimed invention has substantial utility. 

The Examiner rejected the claims at issue on the ground that the use of an invention as tool for 
research is not a "substantial" use. Because the Examiner's rejection assumes a substantial 
overstatement of the law, and is incorrect in fact, it must be overturned. 

There is no authority for the proposition that use as a tool for research is not a substantial utility. 

Indeed, the Patent Office has recognized that just because an invention is used in a research setting 

does not mean that it lacks utility (MPEP § 2107): 

Many research tools such as gas chromatography, screening assays, and nucleotide sequencing 
techniques have a clear, specific and unquestionable utility (e.g., they are useful in analyzing 
compounds). An assessment that focuses on whether an invention is useful only in a research 
setting thus does not address whether the specific invention is in fact "useful" in a patent sense. 
Instead, Office personnel must distinguish between inventions that have a specifically identified 
utility and inventions whose specific utility requires further research to identify or reasonably 
confirm. 

The Patent Office's actual practice has been, at least until the present, consistent with that 
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DNA ligases. These are acknowledged by the PTCTs Training Materials themselves to be useful as 
well as DNA sequences used, for example, as markers. 

Only a limited subset of research uses are not "substantial" utilities: those in which the only 
known use for the claimed invention is to be an object of further study, thus merely inviting further 
research. This follows from Brenner, in which the U.S. Supreme Court held that a process for making 
a compound does not confer a substantial benefit where the only known use of the compound was to 
be the object of further research to determine its use. Id. at 535. Similarly, in Kirk, the Court held that 
a compound would not confer substantial benefit on the public merely because it might be used to 
synthesize some other, unknown compound that would confer substantial benefit. Kirk, 376 F.2d at 
940, 945 ("What Applicants are really saying to those in the art is take these steroids, experiment, and 
find what use they do have as medicines."). Nowhere do those cases state or imply, however, that a 
material cannot be patentable if it has some other beneficial use in research. 

Such beneficial uses beyond studying the claimed invention itself have been demonstrated, in 
particular those described in the Bedilion and Furness Declarations. The claimed invention is a tool, 
rather than an object, of research. The data generated in gene expression monitoring using the claimed 
invention as a tool is not used merely to study the claimed polynucleotide itself, but rather to study 
properties of tissues, cells, and potential drug candidates and toxins. Without the claimed invention, the 
information regarding the properties of tissues, cells, drug candidates and toxins is less complete. 

Moreover, as discussed above in section II D., SEQ ID NO: 12 shares homology with other 
members of the C-lectin family that bind to viral glycoproteins. Therefore, the skilled artisan would 
have considered HCSRP to be an important and valuable tool, in particular, for use in research on cell 
proliferative disorders, immune system disorders, infections, and neuronal disorders. The claimed 
invention has numerous other uses as a research tool, each of which alone is a "substantial utility/' 
These include uses such as diagnostic assays (e.g., pages 40-44), chromosomal markers (e.g., pages 
44-45), ligand screening assays (e.g., page 33), and drug screening (page 45-46). 
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IV. By Requiring the Patent Applicant to Assert a Particular or Unique Utility, the Patent 
Examination Utility Guidelines and Training Materials Applied by the Patent 
Examiner Misstate the Law 

There is an additional, independent reason to overturn the rejections: to the extent the rejections 
are based on Revised Interim Utility Examination Guidelines (64 FR 71427, December 21, 1999), the 
final Utility Examination Guidelines (66 FR 1092, January 5, 2001) and/or the Revised Interim Utility 
Guidelines Training Materials (USPTO Website www.uspto.gov, March 1, 2000), the Guidelines and 
Training Materials are themselves inconsistent with the law. 

The Training Materials, which direct the Examiners regarding how to apply the Utility 

Guidelines, address the issue of specificity with reference to two kinds of asserted utilities: "specific" 

utilities which meet the statutory requirements, and "general" utilities which do not. The Training 

Materials define a "specific utility" as follows: 

A [specific utility] is specific to the subject matter claimed. This contrasts to general utility that 
would be applicable to the broad class of invention. For example, a claim to a polynucleotide 
whose use is disclosed simply as "gene probe" or "chromosome marker" would not be 
considered to be specific in the absence of a disclosure of a specific DNA target. Similarly, a 
general statement of diagnostic utility, such as diagnosing an unspecified disease, would 
ordinarily be insufficient absent a disclosure of what condition can be diagnosed. 

The Training Materials distinguish between "specific" and "general" utilities by assessing 
whether the asserted utility is sufficiently "particular," i.e., unique (Training Materials at p. 52) as 
compared to the "broad class of invention." (In this regard, the Training Materials appear to parallel 
the view set forth in Stephen G. Kunin, Written Description Guidelines and Utility Guidelines , 82 
J.P.T.O.S. 77, 97 (Feb. 2000) ("With regard to the issue of specific utility the question to ask is 
whether or not a utility set forth in the specification is particular to the claimed invention.")). 

Such "unique" or "particular" utilities never have been required by the law. To meet the utility 
requirement, the invention need only be "practically useful/ 1 NattcL 480 F.2d 1 at 1397, and confer a 
"specific benefit" on the public. Brenner, 383 U.S. at 534. Thus, incredible "throwaway" utilities, such 
as trying to "patent a transgenic mouse by saying it makes great snake food," do not meet this standard. 
Karen Hall, Genomic Warfare , The American Lawyer 68 (June 2000) (quoting John Doll, Chief of the 
Biotech Section of USPTO) 
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This does not preclude, however, a general utility, contrary to the statement in the Training 
Materials where "specific utility" is defined (page 5). Practical real-world uses are not limited to uses 
that are unique to an invention. The law requires that the practical utility be "definite, " not particular. 
Montedison, 664 F.2d at 375. Applicant is not aware of any court that has rejected an assertion of 
utility on the grounds that it is not "particular" or "unique" to the specific invention. Where courts have 
found utility to be too "general," it has been in those cases in which the asserted utility in the patent 
disclosure was not a practical use that conferred a specific benefit. That is, a person of ordinary skill in 
the art would have been left to guess as to how to benefit at all from the invention. In Kirk, for 
example, the CCPA held the assertion that a man-made steroid had "useful biological activity" was 
insufficient where there was no information in the specification as to how that biological activity could be 
practically used. Kirk, 376 F.2d at 941. 

The fact that an invention can have a particular use does not provide a basis for requiring a 
particular use. See Brana, supra (disclosure describing a claimed antitumor compound as being 
homologous to an antitumor compound having activity against a "particular" type of cancer was 
determined to satisfy the specificity requirement). "Particularity" is not and never has been the sine qua 
nan of utility; it is, at most, one of many factors to be considered. 

As described supra, broad classes of inventions can satisfy the utility requirement so long as a 
person of ordinary skill in the art would understand how to achieve a practical benefit from knowledge 
of the class. Only classes that encompass a significant portion of nonuseful members would fail to meet 
the utility requirement. Supra § II. B. 2 (Montedison, 664 F.2d at 374-75). 

The Training Materials fail to distinguish between broad classes that convey information of 
practical utility and those that do not, lumping all of them into the latter, unpatentable category of 
"general" utilities. As a result, the Training Materials paint with too broad a brush. Rigorously applied, 
they would render unpatentable whole categories of inventions that heretofore have been considered to 
be patentable and that have indisputably benefitted the public, including the claimed invention. See 
supra § II. B. Thus the Training Materials cannot be applied consistently with the law. 
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V. To the Extent the Rejection of the Patented Invention under 35 U.S.C. § 112, First 
Paragraph, Is Based on the Improper Rejection for Lack of Utility under 35 U.S.C. 
§ 101, it Must Be Reversed. 

The rejection set forth in the Office Action is based on the assertions discussed above, i.e., that 
the claimed invention lacks patentable utility. To the extent that the rejection under § 1 12, first 
paragraph, is based on the improper allegation of lack of patentable utility under § 101, it fails for the 
same reasons. 

Written description rejections under 35 U.S.C. § 112, first paragraph 

CI aims 21, 23, 26, 27, 28, 30, 31, 35, and 37 have been rejected under the first paragraph of 
35 U.S.C. 1 12 for alleged lack of an adequate written description. This rejection is respectfully 
traversed. 

As a preliminary matter, claim 21 (b) has been amended to recite "a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to the amino acid sequence of SEQ ED 
NO: 12, wherein said polypeptide binds to human immunodeficiency virus glycoprotein gp!20 ." Claim 
21 (c) has been amended to recite a biologically active fragment of a polypeptide having the amino acid 
sequence of SEQ ID NO: 12, wherein said fragment binds to human immunodeficiency virus 
glycoprotein gp!20 . By these amendments, Applicants expressly do not disclaim equivalents of the 
invention which could include polypeptides and fragments having biological activities other than binding 
to human immunodeficiency virus glycoprotein gp!20. Applicants are amending the claims solely to 
obtain expeditious allowance of the instant application. Support for these amendments to claim 21 can 
be found in the specification, for example, at Table 2, which lists SEQ ID NO: 12 as a homolog of a 
known gpl20 receptor (GENESEQAAR32 188) that binds gpl20 (International Patent WO 
93/01820). As mentioned above (See Section II D.), SEQ ID NO: 12 shares homology with other C- 
Icctin proteins known to bind gp 1 20 and contains a C-typc lectin domain that may mediate 
carbohydrate recognition of viral envelope glycoproteins, including gpl20. 

The requirements necessary to fulfill the written description requirement of 35 U.S.C. 1 12, first 
paragraph, arc well established by case law. 
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the art that, as of the filing date sought, he or she was in possession of the invention. 
The invention is, for purposes of the "written description" inquiry, whatever is now 
claimed. Vas-Cath, Inc. v. Mahurkar, 19 USPQ2d 1111,1117 (Fed. Cir. 1991) 

Attention is also drawn to the Patent and Trademark Office's own "Guidelines for Examination 
of Patent Applications Under the 35 U.S.C. Sec. 1 12, para. V\ published January 5, 2001, which 
provide that : 

An applicant may also show that an invention is complete by disclosure of sufficiently 
detailed, relevant identifying characteristics 42 which provide evidence that applicant was 
in possession of the claimed invention, 43 i.e., complete or partial structure, other physical 
and/or chemical properties, functional characteristics when coupled with a known or 
disclosed correlation between function and structure, or some combination of such 
characteristics. 44 What is conventional or well known to one of ordinary skill in the art 
need not be disclosed in detail. 45 If a skilled artisan would have understood the inventor 
to be in possession of the claimed invention at the time of filing, even if every nuance of 
the claims is not explicitly described in the specification, then the adequate description 
requirement is met. 46 

Thus, the written description standard is fulfilled by both what is specifically disclosed and what 
is conventional or well known to one skilled in the art. 

SEQ ID NO: 12 and SEQ ID NO:25 are specifically disclosed in the application (see, for 
example, pages 21-22). Variants of SEQ ID NO: 12 and SEQ ED NO:25 are described, for example, 
at page 22, line 23 through page 23, line 4. Incyte clones in which the nucleic acids encoding the 
human HCSRP were first identified and libraries from which those clones were isolated are described, 
for example, at Table 1 of the Specification. Chemical and structural features of SEQ ID NO: 12 are 
described, for example, in Table 2. Given SEQ ID NO: 12 and SEQ ID NO:25, one of ordinary skill in 
the art would recognize naturally-occurring variants of SEQ ID NO: 12 having 90% sequence identity to 
SEQ ID NO: 12 and naturally-occurring variants of SEQ ID NO:25 having 90% sequence identity to 
SEQ ID NO:25. Accordingly, the Specification provides an adequate written description of the recited 
polynucleotide and polypeptide sequences. 
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A. The Specification provides an adequate written description of the claimed 
"variants" of SEQ ID NO:12 and SEQ ID NO:25. 

The Office Action has further asserted that the claims are not supported by an adequate written 

description because "it cannot be established that a representative number of species have been 
disclosed to support the genus claim" (Office Action, page 10). 

Such a position is believed to present a misapplication of the law. 

1. The present claims specifically define the claimed genus through the recitation 
of chemical structure 

Court cases in which "DNA claims" have been at issue (which are hence relevant to claims to 
proteins encoded by the DNA) commonly emphasize that the recitation of structural features or 
chemical or physical properties are important factors to consider in a written description analysis of 
such claims. For example, in Fiers v. Revel 25 USPQ2d 1601, 1606 (Fed. Cir. 1993), the court 
stated that: 

If a conception of a DNA requires a precise definition, such as by structure, formula, 
chemical name or physical properties, as we have held, then a description also requires 
that degree of specificity. 

In a number of instances in which claims to DNA have been found invalid, the courts have 

noted that the claims attempted to define the claimed DNA in terms of functional characteristics without 

any reference to structural features. As set forth by the court in University of California v. Eli Lilly 

and Co., 43 USPQ2d 1398, 1406 (Fed. Cir. 1997): 

In claims to genetic material, however, a generic statement such as "vertebrate insulin 
cDNA" or "mammalian insulin cDNA," without more, is not an adequate written 
description of the genus because it does not distinguish the claimed genus from others, 
except by function. 

Thus, the mere recitation of functional characteristics of a DNA, without the definition of 
structural features, has been a common basis by which courts have found invalid claims to DNA. For 
example, in Lilly. 43 USPQ2d at 1407, the court found invalid for violation of the written description 
requirement the following claim of U.S. Patent No. 4,652,525: 

I \ recombinant nlasmid rcn!ici.ib!e m procarvotic host containing within its nucleotide 




Docket No.: PF-0636 USN 

sequence a subsequence having the structure of the reverse transcript of an mRN A of a 
vertebrate, which mRNA encodes insulin. 

In tiers, 25 USPQ2d at 1603, the parties were in an interference involving the 
following count: 

A DNA which consists essentially of a DNA which codes for a human fibroblast 
interferon-beta polypeptide. 

Party Revel in the Fiers case argued that its foreign priority application contained an adequate 
written description of the DNA of the count because that application mentioned a potential method for 
isolating the DNA. The Revel priority application, however, did not have a description of any particular 
DNA structure corresponding to the DNA of the count. The court therefore found that the Revel 
priority application lacked an adequate written description of the subject matter of the count. 

Thus, in Lilly and Fiers, nucleic acids were defined on the basis of functional characteristics 
and were found not to comply with the written description requirement of 35 U.S.C. §112; i.e., "an 
mRNA of a vertebrate, which mRNA encodes insulin" in Lilly, and "DNA which codes for a human 
fibroblast interferon-beta polypeptide" in Fiers. In contrast to the situation in Lilly and Fiers, the 
claims at issue in the present application define polynucleotides or polypeptides specifically in terms of 
chemical structure, rather than on functional characteristics. For example, the "variant language" of 
independent claims 21 and 30 recite chemical structure to define the claimed genus: 

21. An isolated polypeptide selected from the group consisting of: ... 
b) a polypeptide comprising a naturally occurring amino acid sequence at 
least 90% identical to the amino acid sequence of SEQ ID NO: 12... 

30. An isolated polynucleotide selected from the group consisting of: ... 
b) a polynucleotide comprising a naturally occurring polynucleotide 

sequence at least 90% identical to the polynucleotide sequence of SEQ 

IDNO:25... 

From the above it should be apparent that the claims of the subject application are 
fundamentally different from those found invalid in Lilly and Fiers. The subject matter of the present 
claims is defined in terms of the chemical structures of SEQ ID NO: 1 2 and SEQ ID NO:25. In the 
present case, there is no reliance merely on a description of functional characteristics of the 
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characteristics. Moreover, if such functional recitations were included, it would add to the structural 
characterization of the recited polynucleotides or polypeptides. The polynucleotides or polypeptides 
defined in the claims of the present application recite structural features, and cases such as Lilly and 
Fiers stress that the recitation of structure is an important factor to consider in a written description 
analysis of claims of this type. By failing to base its written description inquiry u on whatever is now 
claimed," the Office Action failed to provide an appropriate analysis of the present claims and how they 
differ from those found not to satisfy the written description requirement in Lilly and Fiers 

2. The present claims do not define a genus which is "highly variant" 

Furthermore, the claims at issue do not describe a genus which could be characterized as 
"highly variant." Available evidence illustrates that the claimed genus is of narrow scope. 

In support of this assertion, the Examiner's attention is directed to the enclosed reference by 
Brenner et al. ("Assessing sequence comparison methods with reliable structurally identified distant 
evolutionary relationships," Proc. Natl. Acad. Sci. USA (1998) 95:6073-6078). Through exhaustive 
analysis of a data set of proteins with known structural and functional relationships and with <90% 
overall sequence identity, Brenner et al. have determined that 30% identity is a reliable threshold for 
establishing evolutionary homology between two sequences aligned over at least 150 residues. 
(Brenner et al., pages 6073 and 6076.) Furthermore, local identity is particularly important in this case 
for assessing the significance of the alignments, as Brenner et al. further report that >40% identity over 
at least 70 residues is reliable in signifying homology between proteins. (Brenner et al., page 6076.) 

The present application is directed, inter alia, to human cell surface receptor proteins related to 
the amino acid sequence of SEQ ID NO: 12. In accordance with Brenner et al, naturally occurring 
molecules may exist which could be characterized as human cell surface receptor proteins and which 
have as little as 40% identity over at least 70 residues to SEQ ID NO: 12. The "variant language" of 
the present claims recites, for example, polypeptides or polynucleotides encoding "a naturally-occurring 
amino acid sequence having at least 90% sequence identity to the sequence of SEQ ID NO: 12" (note 
that SEQ ID NO: 12 has 325 amino acid residues). This variation is far less than that of all potential 
human cell surface receptor proteins related to SEQ ID NO: 12. i.e.. those human cell surface receptor 
proteins having as little as 40%- identity over at least 70 residues to SEQ ID NO: 12. 
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3. The state of the art at the time of the present invention is further advanced than 
at the time of the Lilly and Fiers applications 

In the Lilly case, claims of U.S. Patent No. 4,652,525 were found invalid for failing to comply 
with the written description requirement of 35 U.S.C. §112. The '525 patent claimed the benefit of 
prionty of two applications, Application Serial No. 801,343 filed May 27, 1977, and Application Serial 
No. 805,023 filed June 9, 1977. In the Fiers case, party Revel claimed the benefit of priority of an 
Israeli application filed on November 21, 1979. Thus, the written description inquiry in those case was 
based on the state of the art at essentially at the "dark ages" of recombinant DNA technology. 

The present application has a priority date of March 8, 1999. Much has happened in the 
development of recombinant DNA technology in the 22 or more years from the time of filing of the 
applications involved in Lilly and Fiers and the present application. For example, the technique of 
polymerase chain reaction (PCR) was invented. Highly efficient cloning and DNA sequencing 
technology has been developed. Large databases of protein and nucleotide sequences have been 
compiled. Much of the raw material of the human and other genomes has been sequenced. With these 
remarkable advances one of skill in the art would recognize that, given the sequence information of 
SEQ ID NO: 12 and SEQ ID NO:25, and the additional extensive detail provided by the subject 
application, the present inventors were in possession of the claimed polynucleotide and polypeptide 
variants at the time of filing of this application. 

4. Summary 

The Office Action failed to base its written description inquiry "on whatever is now claimed." 
Consequently, the Action did not provide an appropriate analysis of the present claims and how they 
differ from those found not to satisfy the written description requirement in cases such as Lilly and 
Fiers. In particular, the claims of the subject application are fundamentally different from those found 
invalid in Lilly and Fiers. The subject matter of the present claims is defined in terms of the chemical 
structure of SEQ ID NO: 12 or SEQ ID NO:25. The courts have stressed that structural features are 
important factors to consider in a written description analysis of claims to nucleic acids and proteins. In 
addition, the genus of polynucleotides or polypeptides defined by the present claims is adequately 
described, as evidenced by Brenner et al and consideration of the claims of the '740 patent involved in 
Lilly. Furthermore, there have been remarkable advances in the state oi the art since the Lilly and 
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Fiers cases, and these advances were given no consideration whatsoever in the position set forth by the 
Office Action. Therefore, withdrawal of the rejections under U.S.C. § 1 12, first paragraph is 
respectfully requested. 

Rejection under 35 U.S.C. § 112, second paragraph 

Claims 21-31 and 35-37 have been rejected under 35 U.S.C. § 1 12, second paragraph, as 
allegedly being "indefinite for failing to particularly point out and distinctly claim the subject matter 
which applicant regards as the invention." (Office Action, page 10). In particular, the Examiner states 
that it is not clear what is meant by the term "naturally occurring". Applicants respectfully disagree and 
traverse the rejection. 

The M.P.E.P. provides guidelines to Examiners for rejections under 35 U.S.C. § 112, second 

paragraph as follows: 

...a full explanation of the deficiency of the claims should be supplied. Whenever 
possible, identify the particular term(s) or limitation(s) which render the claim(s) 
indefinite and state why such term or limitation renders the claim indefinite. If the scope 
of the claimed subject matter can be determined by one having ordinary skill in the art, 
a rejection using this form paragraph would not be appropriate. 
(M.P.E.P. § 706.03(d)) 

Therefore claims must be examined on the basis of whether one having ordinary skill in the art 
would be able to determine the scope of the claim and, if a rejection is made, reasons must be provided 
why the claim is indefinite. Applicants submit that the Examiner has not provided any reasons or 
evidence why the cited phrase is indefinite and/or why one having ordinary skill in the art could not 
determine the scope of the claim. For this reason alone, the rejection is improper and should be 
withdrawn. 

The term "naturally occurring" defines the amino acid and nucleotide sequences encompassed 
by the claims. "Naturally occurring" means present in or produced by nature. The use of the term 
"naturally-occurring" distinguishes a sequence that occurs in nature from synthetic or engineered 
sequences that do not. One of skill in the art would understand the meaning of the term "naturally- 
occurring" within the context of the claims. Applicants are claiming those polynucleotides and 
polypeptides comprising sequences that already exist in nature. Applicants wish to clarify that the claim 
language does not preclude making such sequences synthetically. Claims 21-27, 30, 31, 35. and 36 
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polypeptides. Applicants have the right to claim a subgenus of variants of SEQ ID NO: 12 and SEQ ID 
NO:25, those comprising naturally occurring sequences, as opposed to all variant sequences that can 
be made synthetically. The scope of these claims is clear. Therefore, withdrawal of the rejections 
under U.S.C. § 1 12, second paragraph is respectfully requested. 

Rejections under 35 U.S.C. § 102 

Claims 21(d) and 31 have been canceled, thus the rejections under 35 U.S.C. § 102 with 
respect to these claims and their dependent claims are moot. 
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CONCLUSION 

In light of the above amendments and remarks. Applicants submit that the present application is 
fully in condition for allowance, and request that the Examiner withdraw the outstanding rejections. 
Early notice to that effect is earnestly solicited. 

If the Examiner contemplates other action, or if a telephone conference would expedite 
allowance of the claims, Applicants invite the Examiner to contact Applicants 7 Attorney at 
(650) 855-0555. 

Applicants believe that no fee is due with this communication. However, if the USPTO 
determines that a fee is due, the Commissioner is hereby authorized to charge Deposit Account No. 
09-0108. 



Respectfully submitted, 
INCYTE GENOMICS, INC. 

Date: ' f • . - ■ - ' , ' , 

Jenny Buofibinder 
Reg. No.'48,588 

Direct Dial Telephone: (650) 843-7212 



Date: /S /'b>UJ 



I I 

XJ/2 



Cathleen M. Rocco 
Reg. No. 46,172 

Direct Dial Telephone: (650) 845-4587 



3160 Porter Drive 
Palo Alto. California 94304 
Phone: (650) 855-0555 
Fax: (650) 849-8886 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

IN THE SPECIFICATION: 

The following sentence was added immediately after the title of the application on page 1 of the 
specification: 

This application claims benefit under 35 U.S.C. g 1 19(e) of provisional application 
60/183, 048, filed on November 12. 1999. provisional application 60/183.045 filed on December 7. 
1998. and provisional application 60/123.404 filed on March 8. 1999. 

The sixth column of Table 2 at page 63 of the application, beginning with the row for SEQ ID 
NO: 12, was replaced with the following rewritten column: 



Homologous Sequences 
Non-CD4 glycoprotein 
gpl20 receptor 
GENESEO AA R32188 
DP prostanoid receptor 
g940379 
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IN THE CLAIMS: 

Claim 31 has been canceled. 

Claims 21, 22, 25, 29, 30, and 36 have been amended as follows: 

21. (Once Amended) An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence [selected from the group consisting 
of SEQ ID NO:l-131 of SEP ID NO:12 . 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to [an] the amino acid sequence [selected from the group consisting of SEQ 
ID NO:l-13] of SEP ID NO: 12, wherein said polypeptide binds to human 
immunodeficiency virus glycoprotein gp!20, and 

c) a biologically active fragment of a polypeptide having [an] the amino acid sequence 
[selected from the group consisting of SEQ ID NO: 1-13] of SEP ID NO: 12, wherein 
said fragment binds to human immunodeficiency virus glycoprotein gp!20. [, and 

d) an immunogenic fragment of a polypeptide having [an] the amino acid sequence 
[selected from the group consisting of SEQ ID NO: 1-13] of SEP ID NO: 12 .1 

22. (Once Amended) An isolated polypeptide of claim 21 comprising [an] the amino acid 
sequence [selected from the group consisting of SEQ ID NO: 1-13] of SEP ID NO: 12 . 

25. (Once Amended) An isolated polynucleotide of claim 24 comprising a polynucleotide 
sequence [selected from the group consisting of SEQ ID NO: 14-26] of SEQ ID NO:25 . 

2 ( >. (Once Amended) A method of claim 28, wherein the polypeptide comprises [an) the 
amino acid sequence [selected from the group consisting of SEQ ID NO: 1-13] of SEQ ID NO: 12 . 



30. (Once Amended) An isolated polynucleotide selected from the group consisting of: 
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a) a polynucleotide comprising a polynucleotide sequence [selected from the group 
consisting of SEQ ID NO: 14-26] of SEP ID NO:25 , 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to [a] the polynucleotide sequence [selected from the group consisting of SEQ 
ID NO: 14-26] of SEP ID NO:25 , 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

36. (Once Amended) A composition of claim 35, wherein the polypeptide comprises [an] the 
amino acid sequence [selected from the group consisting of SEQ ID NO: 1-13] of SEP ID NP: 12 . 
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DECLARATION OF DR. TOD BEDILION 
UNDER 37 C.F.R.S 1.132 



I, TOD BEDILION, a citizen of the United States, residing at 132 Winding Way, San 
Carlos, California, declare that: 



1. I was employed by Incyte Genomics, Inc. (hereinafter "Incyte") as a Director 
of Corporate Development until May 11, 2001. I am currently under contract to be a Consultant to 
Incyte Genomics, Inc. 



2. In 1996, 1 received a Ph.D. degree in Cell, Molecular and Development 
Biology from UCLA. I had previously received, in 1988, a B.S. degree in biology from UCLA. 

Upon my graduation from UCLA, I became, in April 1996, the first employee of 
Synteni, Inc. (hereinafter "SyntcnD. I was a Research Director at Synteni from April 1996 until 
Syntcni was acquired by Incyte in early 1998. 



Docket No.: PF-0636 USN 

I understand that Synteni was founded in 1994 by T. Dan Shalon while he was a 
graduate student at Stanford University. I further understand that Synteni was founded for the purpose 



of commercially exploiting certain "cDNA rmcroarray" technology that was being worked on at 
Stanford in the early to mid-1990s. That technology, which I will sometimes refer to herein as the 
"Stanford-developed cDNA microarray technology", was the subject of Dr. Shalon's doctoral thesis at 
Stanford. I understand and believe that Dr. P.O. Brown was Dr. Shalon's thesis advisor at Stanford. 

During the period beginning before I was employed by Synteni and ending upon its 
acquisition by Incyte in early 1998, 1 understand Synteni was the exclusive licensee of the Stanford- 
developed cDNA microarray technology, subject to any right that the United States government may 

have with respect to that technolopv In pnrlv 1QQ8 i im,Wct.,„^ i — j ■ ., 

aj . -.. j . ,^ w , . w.. U v.i jiuuu in^^iv^ acLjuucu i ignis uiiuei trie 

Stanford-developed cDNA microarray technology as part of its acquisition of Synteni. 

I understand that at the time of the commencement of my employment at Synteni in 
April 1996, Synteni's rights with respect to the Stanford-developed cDNA technology included rights 
under a United States patent application that had been filed June 7, 1995 in the names of Drs. Brown 
and Shalon and that subsequently issued as United States Patent No. 5,807,522 (the Brown '522 
patent). In December 1995, the subject matter of the Brown '522 patent was published based on a 
PCT patent application that had also been filed in June 1995. The Brown '522 patent (and its 
corresponding PCT application) describes the use of the Stanford-developed cDNA technology in a 
number of gene expression monitoring applications, as will be discussed more fully below. 

Upon Incyte's acquisition of Synteni, I became employed by Incyte. From early 1998 
until late 1999, I was an Associate Research Director at Incyte. In late 1999. I was promoted to the 
position of Director, Corporate Development. 

I have been aware of the Stanford-developed cDNA microarray technology since 
shortly before I commenced my employment at Synteni. While I was employed by Synteni. virtually all 
(if not all ) of my work efforts (as well as the work efforts of others employed by Synteni ) were directed 
io the further development and commercial exploitation of that cDNA microarrav technology. Bv the 



Docket No.: PF-0636 USN 

end of 1997, those efforts had progressed to the point that I understand Incyte agreed to pay at least 
about $80 million to acquire Synteni. Since I have been employed by Incyte, I have continued to work 



on the further development and commercial exploitation of the cDNA microarray technology that was 
first developed at Stanford in the early to mid-1990s. 

3. I have reviewed the specification of a United States patent application that I 
understand was filed on May 8, 2001 in the names of Tang et al. and was assigned Serial No. 
09/831,458 (hereinafter "the Tang et al. '458 application"). Furthermore, I understand that this United 
States patent application claimed priority to United States Provisional Patent Application Serial No. 
60/123.404 filed on March 8 I99Q rhprpinoftor "th a T.^r, <** „i ma^ — -m ^ 

-- -, lung uui. tut au^uccuiuil 1 lie OEA^ LL* 

NO:12-encoding polynucleotides were described in the Tang et al. '404 application. (Note that the 
sequences of SEQ ID NO: 12 and SEQ ID NO:25 disclosed in the Tang et al. '458 application are 
identical to the sequences referred to as SEQ ID NO:5 and SEQ ID NO: 11, respectively, in the Tang 
et al. '404 application). My remarks herein will therefore be directed to the Tang et al. '404 patent 
application, and March 8, 1999, as the relevant date of filing. In broad overview, the Tang et al. '404 
specification pertains to certain nucleotide and amino acid sequences and their use in a number of 
applications, including gene expression monitoring applications that are useful in connection with 
(a) developing drugs (e.g., the diagnosis of inherited and acquired genetic disorders, expression 
profiling, toxicology testing, and drug development with respect to cancer, an immunopathology, a 
neuropathology, and the like), and (b) monitoring the activity of drugs for purposes relating to evaluating 
their efficacv and toxicitv. 

4. I understand that (a) the Tang et al. '404 application contains claims that arc 
directed to isolated and purified polynucleotides having the sequences disclosed in the Tang et al. '458 
application as SEQ ID NO: 12-cncoding polynucleotides, for example SEQ ID NO:25 (hereinafter "the 
SEQ ID NO: 12-cncodmg polynucleotides"), and (b) the Patent Examiner has rejected those claims on 
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the grounds that the specification of the Tang et al. k 404 application does not disclose a substantial, 
specific and credible utility for the claimed SEQ ID NO: 12-encoding polynucleotides. 1 further 
understand that whether or not a patent specification discloses a substantial, specific and credible utility 
for its claimed subject matter is properly determined from the perspective of a person skilled in the art 
to which the specification pertains at the time of the patent application was filed. In addition, I 
understand that a substantial, specific and credible utility under the patent laws must be a ^real-world" 
utility. 

5. I have been asked (a) to consider with a view to reaching a conclusion (or 

conclusions) as to whether or not I agree with the Patent Examiner's position that the Tang et al. '404 

application does not disclose a substantial, specific and credible ^real-world" utility for the claimed 

SEQ ID NO- 1 2-encoding polynucleotides, and (b) to state and explain the bases for any conclusions I 

reach. I have been informed that, in connection with my considerations, I should determine whether or 

not a person skilled in the art to which the Tang et al. '404 application pertains on March 8, 1999 

would have concluded that the Tang et al. 4 404 application disclosed, for the benefit of the public, a 

specific beneficial use of the SEQ ID NO:12-encoding polynucleotides in their then available and 

disclosed form. I have also been informed that, with respect to the "real-world" utility requirement, the 

Patent and Trademark Office instructs its Patent Examiners in Section 2107 of the Manual of Patent 

Examining Procedure, under the heading "I. 'Real-World Value' Requirement": 

"Many research tools such as gas chromatographs, screening assays, and 
nucleotide sequencing techniques have a clear, specific and unquestionable utility (e.g., 
they are useful in analyzing compounds). An assessment that focuses on whether an 
invention is useful only in a research setting thus does not address whether the specific 
invention is in fact 'useful' in a patent sense. Instead, Office personnel must distinguish 
between inventions that have a specifically identified utility and inventions whose 
specific utility requires further research to identify or reasonably confirm." 

6. I have considered the matters set forth in paragraph 5 of this Declaration and 
have concluded that, contrary to the position I understand the Patent Examiner has taken, the 
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of its tiling a number of substantial, specific and credible real-world utilities for the claimed SEQ ID 
NO: 12-encoding polynucleotides. More specifically, persons skilled in the art on March 8, 1999 
would have understood the Tang et al. '404 application to disclose the use of the SEQ ID NO: 1 2- 
encoding polynucleotides in a number of gene expression monitoring applications that were well-known 
at that time to be useful in connection with the development of drugs and the monitoring of the activity 
of such drugs. I explain the bases for reaching my conclusion in this regard in paragraphs 7-16 below. 

7. In reaching the conclusion stated in paragraph 6 of this Declaration, I 
considered (a) the specification of the Tang et al. '404 application, and (b) a number of published 
articles and patent documents that evidence gene expression monitoring techniques that were 
well-known before the March 8, 1999 filing date of the Tang et al. '404 application. The published 

articles nnH nntent Hnrntnpntc T 

- j * w^/I HHUVyl Ul V. 

(a) Schena, M., Shalon, D., Heller, R„ Chai, A., Brown, P.O., and Davis, 
R W ' Parallel hu man genome analysis: Microarrav-based expression monitoring of 1000 genes . Proc. 
Natl. Acad. Sci. USA, 93, 10614-10619 (1996) (hereinafter "the Schena 1996 article") (copy 
annexed at Tab A); 

(b) Schena, M., Shalon, D., Davis, R.W., Brown, P.O., Quantitative 
Monitoring of Gene Expression Patterns with a Complementary DNA Microarrav . Science, 270, 467- 
470 (1995) (hereinafter "the Schena 1995 article") (copy annexed at Tab B); 

(c) Shalon and Brown PCT patent application WO 95/35505 titled 
"Method and Apparatus For Fabricating Microarrays Of Biological Samples," filed on June 16, 1995, 
and published on December 28, 1995 (hereinafter "the Shalon PCT application") (copy annexed at 
Tab C): 

(d) Brown and Shalon U.S. Patent No. 5,807.522, corresponding to the 
Shalon PCT application, titled "Methods For Fabricating Microarrays Of Biological Samples." filed on 
June 7. 1995 and issued on September 15, 1998 (hereinafter "the Brown '522 patent") (copy annexed 
at Tab D): 
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(e) DeRisi, J., Penland, L., and Brown, P.O. (Group 1); Bittner, ML., 
Meltzer, P.S., Ray, M, Chen, Y., Su, Y.A., and Trent, J.M. (Group 2), Use of a cDNA microarrav to 
analyse gene expression patterns in human cancer . Nat. Genet., 14(4), 457-460 (1996) (hereinafter 
"the DeRisi article") (copy annexed at Tab E); 

(0 Shalon, D., Smith, S.J., and Brown, P.O., A DNA Microarrav System 
for Analyzing Complex DNA Samples Using Two-color Fluorescent Probe Hybridization , Genome 
Res., 6(7), 639-645 (1996) (hereinafter "the Shalon article") (copy annexed at Tab F); 

(g) Heller, R.A., Schena, M., Chai A., Shalon, D., Bedilion, T., Gilmore, 

J., Woolley, D.E., and Davis R.W., Discovery and analysis of inflammatory disease-related genes using 
cDNA microarravs . Proc. Natl. Acad. Sci. USA, 94, 2150-2155 (1997) (hereinafter "the Heller 
article ,v )(copy annexed at Tab G); 

(h) Sainbiook, J., Fnisch, E.F., Maniatis, T., Moiecuiar Cioning, A 
Laboratory Manual , pages 7.37 and 7.38, Cold Spring Harbor Press (1989) (hereinafter "the 
Sambrook Manual") (copy annexed at Tab H); 

8. Many of the published articles and patent documents I considered (i.e., at least 
items (a)-(g) identified in paragraph 7) relate to work done at Stanford University in the early and mid- 
1990s with respect to the development of cDNA microarrays for use in gene expression monitoring 
applications under which Synteni became exclusively licensed. As I will discuss, a person skilled in the 
art who read the Tang et al. 4 404 application on March 8, 1999 would have understood that 
application to disclose the SEQ ID NO: 1 2-encoding polynucleotides to be useful for a number of gene 
expression monitoring applications, e.g., as a probe for the expression of that specific polynucleotide in 
cDNA microarrays of the type first developed at Stanford. 

9. Turning more specifically to the Tang et al. '404 specification, the SEQ ID 
NO:25 polynucleotide (referred to as SEQ ID NO: 1 1 in the Tang et al. k 404 application) is shown at p. 
1 1 as one of 12 sequences under the heading "Sequence Listing/ 1 The Tang ct al. l 404 specification 
specifically teaches that the "the invention features isolated and substantially purified polynucleotides 
that encode HRP" (Tang et al. '404 application at p. 4). It further teaches that (a) the identity of the 
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SEQ ID NO:25 polynucleotide was determined from a spleen tissue cDNA library (SPLNNOT09) 
(Tang et al. '404 application, Table 1), (b) the SEQ ID NO:25 polynucleotide encodes for the human 
receptor protein (HRP) shown as SEQ ID NO: 12 (Tang et al. '404 application at Table 2), and (c) 
northern analysis of SEQ ID NO:25 shows its expression predominantly in cDNA libraries associated 
with hematopoietic/immune, gastrointestinal, cardiovascular, and reproductive tissues (Tang et al. '404 
application at Table 3). 

The Tang et al. '404 application discusses a number of uses of the SEQ ID NO: 12-encoding 
polynucleotides in addition to their use in gene expression monitoring applications. I have not fully 
evaluated these additional uses in connection with the preparation of this Declaration and do not 
express any views in this Declaration regarding whether or not the Tang et al. '404 specification 
discloses these additional uses to be substantial, specific and credible real-world utilities of the SEQ ID 

NO: 1 2-encnHinp nnlvrmrlpnttH^c Pnnc 
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et al. '404 application focuses on the portions of the application that relate to the use of the SEQ ID 

NO: 12-encoding polynucleotides in gene expression monitoring applications. 

10. The Tang '404 application discloses that the polynucleotide sequences 
disclosed therein, including the SEQ ID NO: 12-encoding polynucleotides, are useful as probes in 
microarrays. It further teaches that the microarrays can be used "to monitor the expression level of large 
numbers of genes simultaneously" for a number of purposes, including "to develop and monitor the 
activities of therapeutic agents" (Tang '404 application at p. 33, lines 18-22). 

In the paragraph immediately following the Tang et al. '404 teachings described in the 
preceding paragraph of this Declaration, the Tang '404 application teaches that microarrays can be 
prepared using the previously mentioned cDNA microarray technology developed at Stanford in the 
early to mid-1990s. In this connection, the Tang '404 application specifically cites to the Schena 1996 
article identified in item (a) of paragraph 7 of this Declaration (Tang '404 application at p. 32; supra. 
paragraph 7). 

The Schena 1 996 article is one of a number of documents that were published prior to 
the March S. 1999 filing date of the Tang et al. '404 application that describes the use of the Stanford- 
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developed cDNA technology in a wide range of gene expression monitoring applications, including 
monitoring and analyzing gene expression patterns in human cancer. In view of the Tang et al. fc 404 
application, the Schena 1996 article, and other related pre-March, 1999 publications, persons skilled in 
the art on March 8, 1999 clearly would have understood the Tang et al. '404 application to disclose the 
SEQ ID NO:12-encoding polynucleotides to be useful in cDNA microarrays for the development of 
new drugs and monitoring the activities of drugs for such purposes as evaluating their efficacy and 
toxicity, as explained more fully in paragraph 15 below. 

With specific reference to toxicity evaluations, those of skill in the art who were 
working on drug development in March, 1999 (and for many years prior to March, 1999) without any 
doubt appreciated that the toxicity (or lack of toxicity) of any proposed drug they were working on was 
one of the most important criteria to be considered and evaluated in connection with the development 
of the drug. They would have understood at that time that good drugs arc not only potent, they arc 
specific. This means that they have strong effects on a specific biological target and minimal effects on 
all other biological targets. Ascertaining that a candidate drug affects its intended target, and 
identification of undesirable secondary effects (i.e., toxic side effects), had been for many years among 
the main challenges in developing new drugs. The ability to determine which genes are positively 
affected by a given drug, coupled with the ability to quickly and at the earliest time possible in the drug 
development process identify drugs that are likely to be toxic because of their undesirable secondary 
effects, have enormous value in improving the efficiency of the drug discovery process, and are an 
important and essential part of the development of any new drug. Accordingly, the teachings in the 
Tang et al. '404 application, in particular regarding use of the SEQ ID NO: 12-encoding 
polynucleotides in differential gene expression analysis and in the development and the monitoring of the 
activities of drugs, clearly includes toxicity studies and persons skilled in the art who read the Tang et al. 
L 404 application on March 8, 1999 would have understood that to be so. 

1 I . The Schena 1996 article was not the first publication that described the use of 
the cDNA microarray technique developed at Stanford to monitor quantitatively gene expression 
patterns. More than a year earlier (i.e., in October 1995), the Schena 1995 article, titled "Quantitative 
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Monitonng of Gene Expression Patterns with a Complementary DNA Microarray", was published (see 
Tabs A and B). 

12. As previously discussed (supra, paragraphs 2 and 7), in the mid-1990s patent 
applications were filed in the names of Drs. Shalon and Brown that described the Stanford-developed 
cDNA microarray technology. The two patent documents (i.e., the Shalon PCT application and the 
Brown '522 patent) annexed to this Declaration at Tabs C and D evidence information that was 
available to the public regarding the Stanford-developed cDNA microarray technology before the 
March 8, 1999 filing date of the Tang et al. '404 application. 

The Shalon PCT patent application, which was published in December 1995, contains 
virtually the same (if not exactly the same) specification as the Brown '522 patent. Hence, the Brown 

'522 nafent rlisrlosnrp w;i<: in pffart q\«I1oKIo tn tUo nukllo .>n tU« r>,„ u.. inn; li:...:. . .i . 
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of the Shalon PCT application(see Tabs C and D). For the sake of convenience, I cite to and discuss 

the Brown '522 specification below on the understanding that the descriptions in that specification were 

published as of the December 28, 1995 publication date of the Shalon PCT application. 

The Brown '522 patent discusses, in detail, the utility of the Stanford-developed cDNA 

microarrays in gene expression monitoring applications. For example, in the "Summary Of The 

Invention" section, the Brown '522 patent teaches (see Tab D, col. 4, line 52-col. 5, line 8): 

Also forming part of the invention is a method of detecting 
differential expression of each of a plurality of genes in a first cell type, 
with respect to expression of the same genes in a second cell type. In 
practicing the method, there is first produced fluorescent-labeled 
cDNAs from mRNAs isolated from two cells types, where the cDNAs 
from the first and second cell types are labeled with first and second 
different flourescent reporters. 

A mixture of the labeled cDNAs f rom the two cell types is 
added to an array of polynucleotides representing a plurality of known 
genes derived from the two cell types, under conditions that result in 
hybridization of the cDNAs to complementary-sequence 
polynucleotides in the array. The array is then examined by 
fluorescence under fluorescence excitation conditions in which (i) 
polynucleotides in the arrav that are hvbridized predominantly to 



Docket No.: PF-0636 USN 

cDNAs derived from one of the first or second cell types give a distinct 
first and second fluorescence emission color, respectively, and (ii) 
polynucleotides in the array that are hybridized to substantially equal 
numbers of cDNAs derived from the first and second cell types give a 
distinct combined fluorescence emission color, respectively. The 
relative expression of known genes in the two cell types can then be 
determined by the observed fluorescence emission color of each spot. 

The Brown "522 patent further teaches that the "[m]icroarrays of immobilized nucleic 
acid sequences prepared in accordance with the invention" can be used in "numerous" genetic 
applications, including "monitoring of gene expression' 1 applications (see Tab D at col. 14, lines 36-42). 
The Brown l 522 patent teaches (a) monitoring gene expression (i) in different tissue types, (ii) in 
different disease states, and (iii) in response to different drugs, and (b) that arrays disclosed therein may 
be used in toxicology studies (see Tab D at col. 15, lines 13-18 and 52-58 and col. 18, lines 25-30). 

13. Also pertinent to my considerations underlying this Declaration is the DeRisi 
article, published in December 1996. The DeRisi article describes the use of the Stanford-developed 
cDNA microarray technology "to analyze gene expression patterns in human cancer" (see Tab E at, 
e.g., p. 457). The DeRisi article specifically indicates, consistent with what was apparent to persons 
skilled in the art in December 1996, that increasing the number of genes on the cDNA microarray 
permits a "more comprehensive survey of gene expression patterns," thereby enhancing the ability of 
the cDNA microarray to provide "new and useful insights into human biology and a deeper 
understanding of the gene pathways involved in the pathogenesis of cancer and other diseases" (see 
Tab Eat p. 458). 

14. Other pre-March, 1999 publications further evidence the utility of the cDNA 
microarrays first developed at Stanford in a wide range of gene expression monitoring applications (see, 
e.g., the Shalon and the Heller articles at Tabs F and G). By no later than the March 1 997 publication 
of the Heller article, these publications showed that employees of Synteni (i.e., James Gilmore and 
myself) had used the cDNA microarrays in specific gene expression monitoring applications (sec Tab 
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The Heller article states that the results reported therein "successfully demonstrate the 
use of the cDNA microarray system as a general approach for dissecting human diseases" (Tab G at 
p. 2150). Among other things, the Heller article describes the investigation of "1000 human genes that 
were randomly selected from a peripheral human blood cell library" and "[t]heir differential and 
quantitative expression analysis in cells of the joint tissue. . . to demonstrate the utility of the microarray 
method to analyze complex diseases by their pattern of gene expression" (see Tab G at pp. 2150 et 
seq.). 

Much of the work reported on in the Heller article was done in 1996. That article, 
therefore, evidences how persons skilled in the art were readily able, well prior to March 8, 1999, to 
make and use cDNA microarrays to achieve highly useful results. For example, as reported in the 
Heller article, a cDNA microarray that was used in some of the highly successful work reported on 
therein w:is made from 1,000 genes randomly selected from a human blood cell library. 

15. A person skilled in the art on March 8, 1999, who read the Tang et al. '404 
application, would understand that application to disclose the SEQ ID NO: 12-encoding 
polynucleotides, for example, SEQ ID NO:25, to be highly useful as probes for the expression of that 
specific polynucleotide in cDNA microarrays of the type first developed at Stanford. For example, the 
specification of the Tang et al. '404 application would have led a person skilled in the art in March, 
1999 who was using gene expression monitoring in connection with working on developing new drugs 
for the treatment of cell proliferative disorders, immune system disorders, infections, and neuronal 
disorders to conclude that a cDNA microarray that contained the SEQ ID NO: 12-encoding 
polynucleotides would be a highly useful tool and to request specifically that any cDNA microarray that 
was being used for such purposes contain the SEQ ID NO: 12-encoding polynucleotides. Persons 
skilled in the art would appreciate that cDNA microarrays that contained the SEQ ID NO: 12-cncodmg 
polynucleotides would be a more useful tool than cDNA microarrays that did not contain the 
polynucleotides in connection with conducting gene expression monitoring studies on proposed (or 
actual) drugs for treating cell proliferative disorders, immune system disorders, infections, and neuronal 
disorders tor such purposes as evaluating their efficacy and toxicity. 
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I discuss in more detail in items (a)-(g) below a number of reasons why a person skilled 
in the art, who read the Tang et al. 4 404 specification in March, 1999, would have concluded based on 
that specification and the state of the art at that time, that the SEQ ID NO: 12-encoding polynucleotides 
would be a highly useful tool for inclusion in cDNA microarrays for evaluating the efficacy and toxicity 
of proposed drugs for treating cell proliferative disorders, immune system disorders, infections, and 
neuronal disorders , as well as for other evaluations: 

(a) The Tang et al. '404 application teaches the SEQ ID NO: 12-encoding 
polynucleotides to be useful as probes in cDNA microarrays of the type first developed at Stanford. It 
also teaches that such cDNA microarrays are useful in a number of gene expression monitoring 
applications, including "developing and monitoring the activity of therapeutic agents [i.e., drugs]" (see 
paragraph 10, supra). 

vu; Ljy iviai^n, i?yy, uic oLuiuui u-uc vciupcu ui^iN/\ inicioanay lecnnoiogy 
was a well known and widely accepted tool for use in a wide range of gene expression monitoring 
applications. This is evidenced, for example, by numerous publications describing the use of that 
cDNA technology in gene expression monitoring applications and the fact that, for over a year, the 
technology had provided the basis for the operations of an up-and-running company (Synteni), with 
employees, that was created for the purpose of developing and commercially exploiting that technology 
(see paragraphs 2, 8 and 10-14, supra). The fact that Incyte agreed to purchase Synteni in late 1997 
for an amount reported to be at least about $80 million only serves to underscore the substantial 
practical and commercial significance, in 1997, of the cDNA microarray technology first developed at 
Stanford (see paragraph 2, supra). 

(c) The pre-March, 1999 publications regarding the cDNA microarray 
technology first developed at Stanford that I discuss in this Declaration repeatedly confirm that, 
consistent with the teachings in the Tang et al. k 4()4 application, cDNA microarrays are highly useful 
tools for conducting gene expression monitoring applications with respect to the development of drugs 
and the monitoring of their activity. Among other things, those pre-March, 1999 publications confirmed 
that cDNA microarrays (i) were useful for monitoring gene expression responses to different drugs (see 
paragraph 12, supra). <ii) were useful in analyzing gene expression patterns in human cancer, with 
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increasing the number of genes on the cDNA microarray enhancing the ability of the cDNA microarray 
to provide useful information (see paragraph 13, supra), and (iii) were a valuable tool for use as part of 
a "general approach for dissecting human diseases" and for "analyzing] complex diseases by their 
pattern of gene expression" (see paragraph 14, supra). 

(d) Based on my own extensive work for a company whose business was the 
development and commercial exploitation of cDNA microarray technology for more than two years 
prior to the March, 1999 filing date of the Tang et al. '404 application, I have first-hand knowledge 
concerning the state of the art with respect to making and using cDNA microarrays as of March 8, 
1999 (see paragraphs 2 and 14, supra). Persons skilled in the art as of that date would have (a) 
concluded that the Tang et al. '404 application disclosed cDNA microarrays containing the SEQ ID 
NO:12-encoding polynucleotides to be useful, and (b) readily been able to make and use such 



(e) The Tang et al. '404 specification contains a number of teachings that would 
lead persons skilled in the art on March 8, 1999 to conclude that a cDNA microarray that contained 
the SEQ ID NO:12-encoding polynucleotides would be a more useful tool for gene expression 
monitoring applications relating to drugs for treating cell proliferative disorders, immune system 
disorders, infections, and neuronal disorders than a cDNA microarray that did not contain the SEQ ID 
NO:12-encoding polynucleotides. Among other things, the Tang et al. '404 specification teaches that 
the identity of the SEQ ID NO:25 polynucleotide was determined from a spleen tissue cDNA library 
(SPLNNOT09) (Tang et al. '404 application, Table 1). Moreover, northern analysis of SEQ ID 
NO:25 shows its expression predominantly in cDNA libraries associated with hematopoietic/immune, 
gastrointestinal, cardiovascular, and reproductive tissues. (Tang et al. '404 application at Table 3). 
(Sec paragraph 9, supra). 

Moreover, the Tang et al. *4()4 specification teaches that the HRP protein having the amino 
acid sequence of SEQ ID NO: 12 shares homology with known functional proteins. HRP is a member 
of the human receptor protein family. In particular, SEQ ID NO: 12 shares homology with the 
sequence of the non-CD4 glycoprotein gpl20 receptor (GENESEQ AAR32188). (Tang et al. ^404 
application, at Tabic 2). 
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(f) Persons skilled in the art on March 8, 1999 would have appreciated (i) that 
the gene expression monitoring results obtained using a cDNA microarray containing a probe to a 
sequence selected from the group consisting of SEQ ID NO:12-encoding polynucleotides would vary, 
depending on the particular drug being evaluated, and (ii) that such varying results would occur both 
with respect to the results obtained from the probe described in (i) and from the cDNA microarray as a 
whole (including all its other individual probes). These kinds of varying results, depending on the 
identity of the drug being tested, in no way detracts from my conclusion that persons skilled in the art 
March 8, 1999, having read the Tang et al. '404 specification, would specifically request that any 
cDNA microarray that was being used for conducting gene expression monitoring studies on drugs for 
treating cell proliferative disorders, immune system disorders, infections, and neuronal disorders (e.g., 
toxicology study or any efficacy study of the type that typically takes place in connection with the 

development of a drug) contain anv nnp nf thp <!FO tn wo n ar,™^™ t,~i.,~..„i — i. . 
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Persons skilled in the art on March 8, 1999 would have wanted their cDNA microarray to have a 
probe as described in (i) because a microarray that contained such a probe (as compared to one that 
did not) would provide more useful results in the kind of gene expression monitoring studies using 
cDNA microarrays that persons skilled in the art have been doing since well prior to March 8, 1999. 

The foregoing is not intended to be an all-inclusive explanation of all my reasons for 
reaching the conclusions stated in this paragraph 15, and in paragraph 6, supra. In my view, however, 
it provides more than sufficient reasons to justify my conclusions stated in paragraph 6 of this 
Declaration regarding the Tang et al. '404 application disclosing to persons skilled in the art at the time 
of its filing substantial, specific and credible real-world utilities for the SEQ ID NO:12-encoding 
polynucleotides. 



16. Also pertinent to my considerations underlying this Declaration is the fact that the 
Tang et al. -404 disclosure regarding the uses of the SEQ ID NO:25 polynucleotide for gene 
expression monitoring applications is not limited to the use of that polynucleotide as a probe in 
microarrays. For one thing, the Tang ct al. '404 disclosure regarding the hybridization technique used 
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in gene expression monitoring applications is broad (Tang et al. '404 application at, e.g.. p. 4, lines 3-8). 

In addition, the Tang et al. "404 specification repeatedly teaches that the 
polynucleotides described therein (including the polynucleotide of SEQ ID NO:25) may desirably be 
used as probes in any of a number of long established "standard" non-microarray techniques, such as 
Northern analysis, for conducting gene expression monitoring studies. See, e.g.: 

(a) Tang et al. '404 application at p. 4, lines 27-29 ("[NJorthern analysis is 
indicative of the presence of nucleic acids encoding HRP in a sample, and thereby correlates with 
expression of the transcript from the polynucleotide encoding HRP"); 

(b) Tang et al. '404 application at p. 32, lines 7-1 1 ("The polynucleotide 
sequences encoding HRP may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assavs: and in rriicrn;irniv<; ntiliyina flmHc nr ticcn^c noti^t^ *^ ~u — ^ imn • _ 
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Such qualitative or quantitative methods are well known in the art"); 

(c) Tang et al. '404 application at p. 32, lines 21-29 ("In order to provide a 
basis for the diagnosis of a disorder associated with expression of HRP, a normal or standard profile 
for expression is established. This may be accomplished by combining body fluids or cell extracts 
taken from normal subjects, either animal or human, with a sequence, or a fragment thereof, encoding 
HRP, under conditions suitable for hybridization or amplification. Standard hybridization may be 
quantified by comparing the values obtained from normal subjects with values from an experiment in 
which a known amount of a substantially purified polynucleotide is used. Standard values obtained in 
this manner may be compared with values obtained from samples from patients who are symptomatic 
for a disorder. Deviation from standard values is used to establish the presence of a disorder"); and 

(d) Tang et al. '404 application at p. 37. lines 18-21 ("Northern analysis is a 
laboratory technique used to detect the presence of a transcript of a gene and involves the hybridization 
of a labeled nucleotide sequence to a membrane on which RNAs from a particular cell type or tissue 
have been bound. (See. e.g., Sambrook. supra , ch. 7: Ausubel, 1995. supra , ch. 4 and 16.)" ). 

The "Sambrook et al." reference cited in item (d) immediately above is a reference that 
w as well known lo persons skilled in the art in March. 1999. A copy of pages from that reference 
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manual, which was published in 1989, is annexed to this Declaration at Tab H. The attached pages 
from the Sambrook manual provide an overview of northern analysis and other membrane-based 
technologies for conducting gene expression monitoring studies that were known and used by persons 
skilled in the art for many years prior to the March 8, 1999 filing date of the Tang et al. '404 
application. 

A person skilled in the art on March 8, 1999, who read the Tang et al. '404 
specification, would have routinely and readily appreciated that the SEQ ID NO:12-encoding 
polynucleotides disclosed therein would be useful as a probe to conduct gene expression monitoring 
analyses using northern analysis or any of the other traditional membrane-based gene expression 
monitoring techniques that were known and in common use many years prior to the filing of the Tang et 
al. '404 application. For example, a person skilled in the art in March, 1999 would have routinely and 
readily appreciated that the SEQ ID NO:12-cncoding polynucleotides would be a useful tool in 
conducting gene expression analyses, using the northern analysis technique, in furtherance of (a) the 
development of drugs for the treatment of cell proliferative disorders, immune system disorders, 
infections, and neuronal disorders, and (b) analyses of the efficacy and toxicity of such drugs. 
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17. I declare further that all statements made herein of my own knowledge are true 
and that all statements made herein on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, and that willful false statements may jeopardize the validity 
of this application and any patent issuing thereon. 



Tod Bedilion 



Signed at Redwood City, California 
this day of April, 2003 
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Sb j c t : 


6 1 


LAGVT A/AT TVmVSKVP SSL SOFOSFOnATYnNLTOT.KAAVGFLSFKSF^ 


12 0 


Query : 


93 


AAVGELPEKSKLQEI YQELTRLKJ^AVGELPEKSKLQEIYQELTRLKJUWGELPEKSKLQE 


152 






AAVGELPEKSKLQEIYQELTRLf'AAVGELPEKSf'LQEIYQELTRLflAAVGELPEKSKLQE 




Sb j c t : 


121 


AAV^FT.PFKS^^iOEIYOELTRLtlAAVGELPFK^KT.OFTYOELTRLtlAAVGFLPFKSKr.OF 

jT^uli V ^ JJ J—j j— I X i_j UkJ * \,X_J J— i -L X 1— * J—J X 1 \ -i_J I Jiii V V_J J—j -I — 1 XT X_j 1 w. fcJ X v Xj X_j -X, X \£ J—J X_J X X \. J — J 1 ^Tli i V \JJ X_j XJ XT X_j I v I lXj L—i 


180 


Query : 


153 


IYQELT RLFAAVGELPDQSKQQQIYQELTDLKTAFER 


189 






iyqelt +lkaavgelpdqskqqqiyqeltdlktafer 




Sb j C t : 


181 


T YOFT.TFT.FAAVGFT.PFKSKTjOFT YOFT.TOLFAAVGFLPDOSKOOOTYOFT.TnT.FTAFFR 

_L X \w X_j XJ X jLj JLJ I uTji V \_J J_j J / J_ J_j I\iJK J_J Xj _L X X_j J J ± V^j J 1 1 iJTjTV V \_J J—j J 1 XT J— / X *. \^ _L 1 v^y l_j 1 J X X_/ X.J I v. X li. X X_j I v 


24 0 


Query : 


190 


lcrhcp?;dwtffqgncyfmsnsqrnwhdsvtacqevraqlwiktaeeqnflqlqtsrsn 


249 






lcrhcpkdwtffqgncyfmsnsqrnwhdsvtacqevraqlwiktaeeqnflqlqtsrsn 






241 


T.rRHrPT-'nWTFFnnNrYFM^N^ORNWHn^VTArriFVR AOT AA/T KTAFFOMFT.DT.OTCIR ^N 


3 ID 0 


Query : 


250 


rfs^glsdlnqegtwqwvdgsplspsfqrywnsgepnnsgnedcaefsgsgwndnrcdv 


309 






RFSWMGIjSDLNQEGTWQWVDGSPLSPSFQRYWNSGEPNNSGNEDCAEFSGSGWNDNRCDV 




Sbjct : 


301 


rfswmglsdlnqegtwqwdgsplspsfqrywnsgepnnsgnedcaefsgsgwndnrcdv 


360 


Query : 


310 


DNYWICKKPAACFRDE 32 5 








DNYWICKKPAACFRDE 




Sbjct : 


361 


DNYWICKKPAACFRDE 3 76 





> gipj«jDUD rruju - b itjriM z type i isororm L^omo sapiens j 
Length =3 99 

Score - 621 bits (1585), Expect = e-177 

Identities = 324/399 (81%), Positives - 325/399 (81%), Gaps - 74/399 (18%) 

Query: 1 MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 32 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 
Sbjct : 1 MSDSKEPRVQQLGLLEEDPTTSGIRLFPRDFQFQQIHGHKSSTGCLGHGALVLQLLSFML 60 

Query: 33 LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLP;AAVGELSEKSKLQEIYQELTQLK 92 

LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLKlAAVGELSEKSKLQEIYQELTQLK 
Sbjct : 61 LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLf^VAVGELSEKSKLQEIYQELTQLf; 12 0 

Query: 93 AAVGELPEKSKL QEIYQELTRLPTAAVGELPEKSKLQE 129 

aavgelpekskl qeiyqeltrlkaavgelpeksklqe 
sbjct : 121 aavgelpeksklqeiyqeltrlpiaavgelpeksklqeiyqeltrlkaavgelpeksklqe 180 

Query: 130 IYQELTR LKAAVGELPEKSKLQEIYQELTRLKAAVGE 166 

IYQELTR LKAAVGELPEKSKLQEIYQELT+LKAAVGE 
Sbjct : 181 lYQELTRLFJVAVGELPEP:SKLQEIYQELTELKJVAVGELPEKSKLQEIYQELTQLf:AAVGE 24 0 

Query: 167 LPDQSKQQQIYQELTDLKTAFERLCRHCPF.DWTFFQGNCYFMSNSQRNWHDSVTACQEVR 22 6 

LPDQSKOQQIYQELTDLKTAFERLCPHCPF.DWTFFOGNCYFMSNSQR^HDSVTACC'EVF. 
Sbjct : 241 LPDQSKiX'QIYQELTDLKT^ 3 00 

Query: 2 2 7 AQLWI K rAEEQrjFLQLQTSRSNRFSWMGLSL'LNQErlTWQVA/riGSPLSPSFQRWNSGEP 2 86 

A(«)LWIKTAEEQNFLQLQTSF.SrjRFSWMGLSDLNQEGTWQWVDGSPLSPSFC!RYWNSGEP 
Sbjct : >.U1 AQLVYIKTAEEQNFLQLQTSRSNRFSWMGLSI'LNQK^ 3 60 

Query : 2 87 NIJSGNEDCAEFSGS G WN D N R 1 2 DVD N YW ICYrW AA C F F. D E 32 5 

rnjSGNEDCAEF3GSGWNDNF.CDVDNYWICKKPAACFF.DE 
Sbjct: 361 NN 3 G N E DC A E F S G S GWN DNRC D VDN YW I C K K P AAC F R D E 399 
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BLAST2 Results 



http://patents.incv te.convSOOO/cgi-bin/ScqScrvcr/SeqScrv. 



Score = 621 bits (1585), Expect = e-177 

Identities = 324 '399 (81%), Positives = 325/399 (81%), Gaps - 74/399 (18%) 

Query: 1 MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 3 2 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 
Sbjct : 1 MSDSKEPRVQQLGLLEEDPTTSGIRLFPRDFQFQQIHGHKSSTGCLGHGALVLQLLSFML 60 

Query : 33 LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLPAAVGELSEKSKLQEIYQELTQLP' 92 
LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLf^AVGELSEKSKLQEIYQELTQLK 

sbjct : 61 lagvlvailvqvsp;vpsslsqeqseqdaiyqnltqlplaavgelseksklqeiyqeltqlp: 120 

Query: 93 AAVGELPEKSKL QEIYQELTRLP'AAVGELPSKSKLQE 129 

AAVGELPEKSKL QEIYQELTRLKAAVGELPEKSKLQE 
Sbjct : 121 AAVGELPEt;SKLQEIYQELTRLKAAVGELPEf:SKLC>EIYQELTRLKAAVGELPEKSKLQE 180 

Query: 130 IYQELTR LKAAVGELPEKSKLQEI YQELTRLfAAVGE 166 

IYQELTR LKAAVGELPEKSKLQEIYQELT+LKAAVGE 
Sbjct : 181 IYQELTRLKAAVGELPEKSKLQEIYQELTELKAAVGELPEKSKLQEIYQELTQLKAAVGE 240 

Query: 167 lpdqsp:q>qqiyqeltdlp;taferlcrhcpkdwtffqgncyfmsnsqrnwhdsvtacqevr 22 6 

lpdqskqqqiyqeltdlp:taferlcrhcpp:dwtffQ)Gncyfmsnsqrnwhdsvtacqevr 
Sbjct : 241 lpdqskqqqiyqeltdlktaferlcrhcpkdwtffqgncyfmsnsqrnwhdsvtacqevr 300 

query: 227 aqlwiktaeeqnflqlqtsrsnrfswmglsdlnqegtwqwvdgsplspsfqrywnsgep 2 86 
aqlwi?;taeeqnflqlqtsrsnrfswmglsdlnqegtwqwvdgsplspsfqrywnsgep 

Sbjct: 301 AOLWIKTAEEQNFLQLQTSRSNRFSWMGIjSDLNQEGTWQWVDGSPLSPSFQRYWNSGEP 360 

Query: 2 87 NNSGNEDCAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 32 5 

NNSGNEDCAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 
Sbjct: 361 NNSGNEDCAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 399 



> g!20847 97 probable mannose binding C-type lectin DC-SIGNR [Homo 
saipiens ] 
Length = 399 

Score - 617 bits (1574), Expect = e-175 

Identities - 323/399 (80%), Positives = 324/399 (80%), Gaps - 74/399 (18*) 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 3 2 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFML 
MSDSKEPRVQQLGLLEEDPTTSGIRLFPRDFQFQQIHGHKSSTGCLGHGALVLQLLSFML 6 0 



LAGVLVAILVQVSKVPSSLSQEQSEQDAIYQNLTQLFIAAVGELSEKSKLQEIYQELTQLK 



AAVGELPEKSKL QEIYQELTRLKAAVGELPEKSKLQE 12 9 

AAVGELPEKSKL QEIYQELTRLKAAVGELPEKSKLQE 



IYQELTR L } 'AAV G E L P E K S K L Q E I Y Q E L T + L KAA VG E 

I YQELTRLFAAVGELPEKSKLQEI YQELTELFAAVGELPEKSKLQEI YQELTQLtAAVGE 24 0 

LFTH:)SKr»OC)IYr)ELTDLF.TAFERLCRHCPKDVvTFFC»GNCYFMSMSC'RNV\ : HDSVTACC)EVR 226 
LFTtQSKOQQlVrjELTDLKTAFERLCRHCPKDWTFFQGNCYFMSNSQRNWHDSVTACQEVR 



7 7. 7 AQT VVTKTAFFQnFLQLQTSPSNRFSWMGLSF'T ■NQEG'TWQWvT'GSFLSPSr QRYWNSGtF' 2 86 

T T 77 7 T I/ 1 T 1 7 1 . U 1 T7 1 f i>T U* T r"iT ! T 1 "D CJMD T C T-,7T T * C T CT'T * T-~! irr'T'T.T,- - r T.-.7T TV -. C I ■ T Q T"5 C IT f^. "D V7.-,7>,7 C t" 1 T TT' 



Query : 


1 


Sbjct : 


1 


Query : 


33 


Sbjct : 


61 


Query : 


93 


Sbjct : 


: 2i 


Query : 


13 0 


Sbjct : 


181 


Query : 


167 


Sbjct : 


241 


Que r y : 


7 7. 7 
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BLAST2 Results http://patcnts.incyte.com:8000/cgi-bin/ScqScrver/ScqServ, 

Sbjct: 361 NNSGNEDXAEFSGSGWNDNRCDVDN^-WICKKPAACFRDE 399 

>g! 53 83614 SDC-SIGN2 type I isoforrn [Homo sapiens] 
Length - 332 

Score = 583 bits (1487), Expect = e~165 

Identities = 293/332 (88%), Positives = 300/332 (90%), Gaps = 7/332 (2%) 

Query: 1 MSDSKEPRVQQLGLLG CLGHGALVLQLLSFMLLAG VLVAILVQVSKVPSSLSQ 53 

MSDSKEPRVQQLGLL L + F + G + +L VSKVPSSLSQ 

Sbjct : 1 MSDSKEPRVQQLGLLEEDPTTSGIRLFPRDFQFQQIHGHKSSTVPFLLGPVSKVPSSLSQ 6 0 

Query: 54 EQSEQDAIYQNLTOLfCAAVGELSEKSKLQEIYQELTQLFAAVGELPEfZSKLQEIYQELTR 113 

EQSEQDAIYQNLTQLfLAAVGELSEKSKLQEIYQEIjTQLflAAVGELPEPvSKLQEIYQELTR 
Sbjct : 61 EQSEQDAIYONLTQLf^AAVGELSEKSf:LQEIYQELTQLFAAVGELPEf:SKLQEIYQELTR 12 0 

Query: 114 LKAAVGELPEKSKLQEIYQELTRLFAAVGELPEP'.SKLQEIYQELTRLf^AVGELPDQSKQ 17 3 

LFAAVGELPEHSKLQEIYQELTRLFAAVGELPEKSKLQEIYQELT LKAAVGELP++ SK 
Sbjct: 121 LFJVAVGELPEKSKLQEIYQELTRLi'LAAVGELPEKSKLQEl YQELTELfCAAVGELPEKSPvL 130 

Query: 17 4 QQIYQELTDL?;TAFERLCRHCPKDWTFFQGNCYFMSNSQRNWHDSVTACQEVRAQLWIK 23 3 

Q+IYQELT LK A ERLCRHCPKDWTFFQGNCYFMSNSORNWHDSVTACQEVRAQLWIK 
Sbjct: 181 QEIYQELTQLKAAVERLCRHCPKDWTFFQGNCYFMSNSQRNWHDSVTACQEVRAQLWIfC 240 

Query: 2 34 TAEEQNFLQ»LQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSFQRYWNSGEPNNSGNED 2 93 
TAEEONFLOLOTSRRNRFSWMnT.c:nT.lsjn^r^TTATnvAr\m^QDT CDCi?ADVT,iMpPT7nmTPPMrn 

Sbjct: 241 TAEEQNFLQLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSFQRYWNSGEPNNSGNED 300 

Query: 294 CAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 325 

CAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 
Sbjct: 301 CAEFSGSGWNDNRCDVDNYWICKKPAACFRDE 332 

> g8572543 membrane-associated lectin type-C [Homo sapiens] 
Length = 404 

Score = 532 bits (1356), Expect = e-150 

Identities = 281/392 (71%), Positives = 301/392 (76%), Gaps = 71/392 (18%) 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFMLLAGVLVAI 4 0 

MSDSKEPR+QQLGLL GCLGHG LVLQLLSF LLAG + 
MSDSKEPRLQQLGLLEEEQLRGLGFRQTRGYKSLAGCLGHGPLVLQLLSFTLLAG L 56 

LVQVSKVPSSLSQEQSEQDAIYQNLTQLtlAAVGELSEKSKLQEIYQELTQLI^AAVGELPE 100 
LVQVSKVPSS+SQEQS QDAI YQNLTQLFJ^VGELSEKSKLQEIYQELTQLICAAVGELPE 
LVQVSKVPSSISQEQSRQDAIYQNLTQLFAAVGELSEKSKLQEIYQELTQLP^AVGELPE 116 

KSKL QEIYQELTRL 114 

KSKL QEIYQELTRL 
K S KLQ E I Y Q E LT RI , KAA VG ELPEKSKLQEIYQEL TW I > KAAYG E L P E K S KMQi E I Y Q EL 7 R L 1 7 6 

FAAVGELPEKSKLQEI YQELTRLP2AAVGELPEKSKLQEI YQ 17.1 
KAAVGELPEP'SK QEI YQELTRLKAAVGELPEKSK QEI YQELTRLEAAVGELP+ + SKO<"i 
KAAVGELPEKSKQQEIYQELTRLKAAVGELPEKSKQi^ 2 3 h 

Q I YQELTDL KTAFERLCRHC PKDWT F FQGNC YFMSNSQRNWHDS VTACQEVRAQL W I KT 2 34 
+ IYQELT LK A ERLC CP + WTFFQGNC YFMSNSQRNWHDS + TAC -+ EV AQLWIK + 
Y.l\ UELTC'LP'AAVERLCHPCPWEWTFFQGNCYFMSNSQRNIVHDSITACK EVGAQLWIKS 2 9 6 

AEEQIIFLQLQTSRSNRFSWT'GLSDLNQEGTWOVA^ 2 94 



Query : 


1 


Sbjct : 


1 


Query : 


41 


Sbjct : 


57 


Query : 


101 


Sbjct : 


117 


Query : 


1 i 5 


Sbjct : 


177 


Query : 


17 5 


Sbl c t : 


23 7 


Query : 


235 
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BLAST2 Results 



http://patents.incy tc.com:8000/c2i-bin/ScqScr\cr/ScqScr\ i 



AEFSG + GWND + + C + + +WICKK AA C RDE 
Sbjct: 357 AEFSGNGWNDDKCNLAKFWICKKSAASCSRDE 388 



>g!7 04 9084 unnamed protein product [Homo sapiens] 
Length - 4 04 

Score - 532 bits (1356), Expect = e-150 

Identities - 281/392 (71%), Positives - 301/392 (76%), Gaps - 71/392 (18%) 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFMLLAGVLVAI 4 0 

MSDSKEPR+QQLGLL GCLGHG LVLQLLSF LLAG + 
MSDSKEPRLQQLGLLEEEQLRGLGFRQTRGYKSLAGCLGHGPLVLQLLSFTLLAG L 56 

LVQVSKVPSSLSQEQSEQDAIYQNLTQLKAAVGELSEKSKLQEIYQELTC'LKwAAVGELPE 100 
LVQVSKVPSS+SQEQS QDAIYQNLTQLKAAVGELSEKSKLQEIYQELTQLKAAVGELPE 



KSKL QEI YQELTRL 



KAAVGELPEKSK QEI YQELTRLKAAVGELPEKSK QEI YQELTRLKAAVGELP++SKQQ 



+IYQELT LK A ERLC CP +WTFFQGNCYFMSNSQRNWHDS+TAC+EV AQLWIK+ 



AEEQNFLQLQ+SRSNRF+WMGLSDLNQEGTWQWVDGSPL PSF++YWN GEPNN G EDC 



Query : 


1 


Sbjct : 


1 


Query : 


41 


Sbjct : 


57 


Query : 


101 


Sbjct : 


117 


Query : 


115 


Sbjct : 


177 


Quex y : 


1 / D 


Sbjct : 


237 


Query : 


235 


Sbjct : 


297 


Query : 


295 


Sbjct : 


357 



AEFSG+GWND++C++ +WICKK AA C RDE 



> g!5281073 mDC-SIGNIA type I isoform [Homo sapiens] 
Length = 404 

Score - 532 bits (1356), Expect = e-150 

Identities - 281/392 (71%), Positives - 301/392 (76%), Gaps - 71/392 (18%) 

MSDSKEPRVQQLGLL GCLGHGALVLQLLSFMLLAGVLVAI 40 

MSDSKEPR+QQLGLL GCLGHG LVLQLLSF LLAG + 
MSDSKEPRLQQLGLLEEEQLRGLGFRQTRGYKSLAGCLGHGPLVLQLLSFTLLAG L 56 

LVQVSKVPSSLSQEQSEQDAI Y'jNLTQLKAAVGEIjSEKSKLQEIYQELTQLKAAV''GELPE 10 0 
LVQVSKVPSS+SQEQS QDAI YQNLTQLKAAVGELSEKSKLQEI YQELTQLKAAVGELPE 



KSKL QEI YQELTRL 



KAAVGELPEKSK QEI YQELTKLKAAVGELPEKSK QEI YQELTRLKAAVGELP+ + SEQQ 

v G E L P F K S K O O F T Y Q F L T R T , T K A A' /G F L P F K S K Q Q E T Y Q E L T F I j K A A VG E L P E K S K d d 2 3 6 



Query : 


1 1 
] 


Sbjct : 


1 ] 


Query : 


41 ] 

1 


Sbjct : 


57 ] 


Query : 


101 ] 


Sbjct : 


117 ] 


Query : 
ck;,.» . 


115 ] 
I 
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BLAST2 Results 



http://palents.incyte.com:8000/cgi-bin/ScqScrver/ScqScrv. 



Query: 235 AEEQNFLQLQTSRSNRFSVmGLSDLNQEGTWQWDGSPLSPSFQRYWNSGEPNNSGNEDC 2 94 

AEEQNFLQLQ+SRSNRF+WMGLSDLNQEGTWQWVDGSPL PSF++YWN GEPNN G EDC 
Sbjct: 297 AEEQNFLQLQSSRSNRFTWMGLSDLNQEGTWQWDGSPLLPSFKQ^i^WNRGEPNTJVGEEDC 356 

Query: 2 95 AEFSGSGWNDNRCDVDNYWICUKPAA-CFRDE 32 5 

AEFSG+GWND++C++ +WICKK AA C RDE 
Sbjct: 357 AEFSGNGWNDDKCNLAKFWICKKSAASCSRDE 388 



> g!338 3468 DC-SIGN [Homo sapiens] 
Length = 404 

Score - 532 bits (1356), Expect = e-150 

Identities = 281/392 (71%), Positives -- 301/392 (76%), Gaps = 71 /392 (18%) 

Query: 1 MSDSKEPRVQQLGLL GCLGHGALVLQLLSFMLLAGVLVAI 4 0 

MSDSKEPR+QQLGLL GCLGHG LVLQLLSF LLAG + 
Sbjct : 1 MSDSKEPRLQQLGLLEEEQLRGLGFRQTRGYKSLAGCLGHGPLVLQLLSFTLLAG L 5 6 

Query: 41 LVQVSKVPSSLSQEQSEQDAIYQNLTQLP^AVGELSEKSKLQEIYQELTQLf^AVGELPE 100 

LVQVSKVPSS + SQEQS QDAIYQNLTQLKJVAVGELSEKSKLQEIYQELTQL?;AAVGELPE 
Sbjct: 57 LVQVSKVPSSISQEQSRQDAIYQNLTQLflAAVGELSEKSfvLQEIYQELTQLKAAVGELPE 116 

Query: 101 KSKL QEIYQELTRL 114 

KSKL QEIYQELTRL 
Sbjct : 117 P:SKLQEIYQFXTRLKAAVGELPEKSKLQEIYQELTWLKJVAVGELPEKS^QEIYQELTRL 176 

Query: 115 JCAAVGELPEKSf'LQEIYQELTRLfLAAVGELPEKSKLQEIYQELTRLKAAVGELPDQSKQQ 174 

KAAVGELPEKSK QEIYQELTRLKAAVGELPEKSK QEIYQELTRLKAAVGELP++SKQQ 
Sbjct: 177 PIAAVGELPEKSKQQEIYQELTRLP^AAVGELPEKSKQQEIYQELTRLKAAVGELPEKSfCQQ 236 

Query: 17 5 QIYQELTDLKTAFERLCRHCPKDWTFFQGNCYFMSNSQRNWHDSVTACQEVRAQLWIKT 2 34 

+ IYQELT LK A ERLC CP +WTFFQGNCYFMSNSQRNWHDS+TAC+EV AQLWIK+ 
Sbjct: 237 EI YQELTQLFJVAVERLCHPCPWEWTFFQGNCYFMSNSQRNWHDSITACKEVGAQLWIKS 296 

Query: 2 35 AEEQNFLQLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSFQR YWNSGEPNNSGNEDC 2 94 

AEEQNFLQLQ+SRSNRF+WMGLSDLNQEGTWQWVDGSPL PSF++YWN GEPNN G EDC 
Sbjct: 297 AEEQNFLQLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPSFKQY'WNRGEPNNVGEEDC 356 

Query: 295 AEFSGSGWNDNRCDVDNYWICKKPAA-CFRDE 325 

AEFSG+GWND-+C++ +WICKK AA C RDE 
Sbjct: 357 AEFSGNGWNDDKCNLAKFWICKKSAASCSRDE 388 



> g!01796 10 probable mannose-binding C-type lectin DC-SIGN [Homo 
sapiens ] 
Length = 404 

Score = 532 bits (1356), Expect - e-150 

Identities = 28 1/392 (71%), Positives ---- 301/392 (76%), Gaps = 71 /392 (18%) 

Query: 1 MSDSKEPRVQQLGLL GCLGHGALVLQLLSFMLLAGVLVAI 4 0 

MSDSKEPRVQQLGLL GCLGHG LVLQLLSF LLAG + 

sbjct : i msdskeprlqqlglleeeqlrgix;frqtrgykslagclghgplvlqllsftllag L 56 

Query: 41 LVQVS?^VPSSLSQEQSEQDAIYC.»NLTQLfLAAVGELSEKSKLQEI YQELTQLPLAAVGELPE 100 

LVQVSKVPSS+SQEQS QDAI YQNLTQLKAAVCELSEKSfCLQEI YQELTQLKAAVGELPE 
Sbjct : 57 L V QV S KVP S S I S Q E Q S RQDA I Y QN LTQL KJ\AVG E L S E KS KL Q E I YQ ELTQL KAAYGE L P E 116 

OllPrV 101 VQVT ^T--nr.-,^T mr.T 

^-^j.^^^o*^ I ^'tLUl KIj 114 

KSKL npi Vf>pr .tdt. 
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BLAST2 Results 



http://patents.incytc.com:8000/cgi-bin/SeqScrver/SeqScrvi 



Sbjct : 


17 7 


KAAVGELPEKSKQQEI YQELTRLKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKSKQQ 


236 


Query : 


175 


qiyqeltdlp:taferlcrhcpkdwtffqgncyfmsnsqrnwhdsvtacqevraqlwikt 


234 






+IYQELT LK A ERLC CP +WTFFQGNCYFMSNSQRNWHDS+TAC+EV AQLWIPC + 




Sbjct : 


23 7 


EIYQELTQLKJVAVERLCHPCPWEWTFFQGNCYFMSNSQRNWHDSITACKEVGAQLWIKS 


296 


Query : 


235 


AEEQNFLQLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSFQRYWNSGEPNNSGNEDC 


294 






AEEQNFLQLQ+SRSNRF+WMGLSDLNQEGTWQWVDGSPL PSF++YWN GEPNN G EDC 




Sbjct : 


297 


AEEQNFLQLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPSFKQYWNRGEPNNVGEEDC 


356 


Query : 


295 


AEFSGSGWNDNRCDVDNYWICKKPAA-CFRDE 32 5 








AEFSG + GWND + + C + + +WICKK AA C RDE 




Sbjct : 


357 


AEFSGNGWNDDKCNLAKFWICKKSAASCSRDE 3 88 





Database: genpeptl31 

Posted date: Sep 9, 2002 12:31 PM 
Number of letters in database: 348,344,575 
Number of sequences in database: 1,135,942 

Lambda K H 

0.315 0.130 0.384 

Gapped 

Lambda K H 

0.270 0.0470 0.230 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 213284592 

Number of Sequences: 1135942 

Number of extensions: 8629475 

Number of successful extensions: 36326 

Number of sequences better than 10.0: 1471 

Number of HSP's better than 10.0 without gapping: 638 

Number of HSP's successfully gapped in prelim test: 888 

Number of HSP's that attempted gapping in prelim test: 32786 

Number of HSP's gapped (non-prelim): 2270 

length of query: 325 

length of database: 348,344,575 

effective HSP length: 59 

effective length of query: 266 

effective length of database: 281,323,997 

effective search space: 74832183202 

effective search space used: 74832183202 

T: 11 

A: 40 

XI: 16 { 7.3 bits) 
X2: 3 8 (14.8 bits) 
X3 : 64 (24.9 bits) 
SI: 41 (21.5 bits) 
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EXHIBIT C 



hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 



HMM file: 
Sequence file: 



/data/isb2k/blastdb/Pfam7 2/Pfam7 2 
/u/ legal/ jennyb/pf 63 6 . seq 



Query: 3344986CD1 



Scores for sequence family classification (score includes all domains) 



Model 



Description 



lectin_c Lectin C-type domain 

Ribosomal__L29 Ribosomal L29 protein 



Score 

139 . 5 
-15.3 



E-value N 



5 .9e-38 
9.1 



Parsed for domains: 

Model Domain seq-f seq-t 



Ribosomal_L29 1/1 85 152 . 

lectin_c 1/1 211 317 . 



hmm-f hmm-t 

1 64 [] 
1 125 [] 



score E-value 

-15.3 9.1 
139.5 5.9e-38 



Alignments of top-scoring domains: 

Ribosomal_L29 : domain 1 of 1, from 85 to 152: score -15.3, E = 9.1 

*->akELRelsde . .EL. . eeeleelKrELf eLRAf qaAtGqLenPhrlk 
++EL +1+ + +EL+++ +1 e + +EL L+ aA+G+L +++ 
3 344 9 86CD1 85 YQELTQLKAAvgELpeKSKLQEIYQELTRLK AAVGELPEKSKLQ 12 8 



3344986CD1 



evRkrIARilTv. . . lnErklsae<- 
e+ +++ R++ + ++1 E+ + +e 
129 EIYQELTRLKAAvgeLPEKSKLQE 



151 



lectin_c: domain 1 of 1, from 211 to 317: score 139.5, E - 5.9e-38 

*->esktWaeAelaCqkegghAHLvsIqsaeEqsfwaf ltsltkksnty 
++++W+++ + aCq+ ++ Lv+I aeEq + fl + t + + sn 
3344986CD1 211 SQRNWHDSVTACQEVRAQ- -LWIKTAEEQ NFLQLQTSRSNRF 251 



3344986CD1 



aWIGLtdintegtwvwegwetdgspvnyt . . enWapgePnnrgnhGgnEd 
W+GL+d n+egtw+-w +dgsp++ + + + +W++gePnn gn Ed 
>52 SWMGLSDLNQEGTWQW VDGSPLSPSf qRYWNSGEPNNSGN ED 2 93 



I M4 9 86CD1 



Cveiy tdtdf laGkWnDepCdsklpy^Cef < - 
(>e+ + +f V-/nD + Cd+ + + +C + + 

2 94 CAEFSGS GWNDNRCDVDNYWTCKK 



BLIMPS (BLocks IMProved Searcher) Version 3.4 2000/06 

(C) Copyright 1993-2000, Fred Hutchinson Cancer Research Center 



Probe Sequence: 3344986CD1 
Probe Size: 325 Amino Acids 

Probe File: /u/ legal / jennyb/WIP . 2 3834 /3 344986CD1 . fast a 

Target File (s) : / opt /pat /prod/SOL/bl imps-3 . 4 /dat/blocks . dat 

Records Searched: 4071 

Scores Done: 4071 

Alignments Done: 1439864 

AC# Description Strength Score 

RF AA# 

IPB001304A C-type lectin domain 1072 1164 

0 197 WTFFQGNCYFMSNSQRNWHDSVTAC 

IPB001304B C-type lectin domain 1000 1163 

0 2 52 WMGLS DLNQEGTW 



MOTIFS from: / u/ legal / j ennyb/WIP . 237 23 / 3 344986CD1 . gcg 
Mismatches: 0 January 30, 2003 14:55 

3344986CD1 . gcg Check: 7873 Length: 325 ! >3344986CD1 



C_Type_Lec tin_l 

C(L,I,V,M,F,Y,A,T,G)x{5,12} (W,L)x(D,N, S , R ) x2Cx { 5 ,6} (F,Y,W,L,I,V,S,T,A) ( L , I, V, 
( S,T,A)C 

C(A)x{6}(W)x(D)x{2}Cx{5}(W) (I)C 
2 94: SGNED CAE F S G S GWNDNRC D VDNYW I C KKPAA 
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Parallel human genome analysis: Microarray-based expression 
monitoring of 1000 genes 

(Human Genome Project/DNA chip/gene discovery/T cell) 

Mark Schena't, Dar! Shalom*, Renu Heller', Andrew Qui', p a ™ck O. BrownS, and Ronald W. Dav IS - 
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Contributed by Ronald W. Davis, June 26, 1996 



ABSTRACT Microarrays containing 1046 human cDNAs 
of unknown sequence were printed on glass with high-speed 
robotics. These 1.0-cm* DNA "chips" were used to quantita- 
tively monitor differentia) expression of the cognate human 
genes using a highly sensitive two-color hybridization assay. 
Array elements that displayed differential expression patterns 
under given experimental conditions were characterized by 
sequencing. The identification of known and novel heat shock 
and phorbol ester-regulated genes in human T cells demon- 
strates the sensitivity of the assay. Parallel gene analysis with 
microarrays provides a rapid and efficient method for laree- 
scaie human gene discovery. 

Biology has entered the genome era (1). Complete genome 
sequences for all of the model organisms and human will 
probably be available by the year 2003 (2). Torrents of human 
expressed sequence tags (ESTs) provide a starting point for 
elucidating the function of tens of thousands of cognate genes 
(3). Genome analysis will provide insights into growth, devel- 
opment, differentiation, homeostasis, aging, and the onset of 
diseases (1-3). A detailed understanding of the human genome 
will require the implementation of sophisticated methods for 
gene expression analysis and gene discovery. 

Recently, a microarray-based method for high- throughput 
monitoring of plant gene expression was described (4). This 
"chip"-based approach involved using microarrays of cDNA 
clones as gene-specific hybridization targets to quantitatively 
measure expression of the corresponding plant genes (4, 5). A 
two-color fluorescence labeling and detection scheme 'facili- 
tated sensitive differeotiaJ expression analysis of different 
plant tissues (4, 5). The efficiency of this approach for studies 
m higher plants suggested the use of this method for human 
genome analysis (4-7). Here, we report the use of cDNA 
microarrays for human gene expression monitoring, biological 
investigation, and gene discovery. 

MATERIALS AND METHODS 
Human cDNA Clones. The cDNA library was made with 
mRNA from human peripheral blood lymphocytes trans- 
formed with the Epstein-Barr virus. Inserts > 600 bp were 
cloned into the lambda vector AYES-R to generate 10-10 8 
recombinants. Bacterial transformants were obtained by in- 
fecting £. coli strain JM107/XKC. Colonies were picked at 
random and propagated in a 96-well format, and minilysaie 
DNA was prepared by aUcaline Ivsis using REAL preps 
(Oiagen, Chatswortb, CA). inserts were amplified by PCR in 
a %-well format using primers (PAN132 5'-CCTC- 
TATACTTTAACGTCAAGG; and PAN 133. 5 f -TTGTGTG- 
GAATTGTGAGCGG) complementary ic ihc AYES 



(Glen Research, Sterling, VA) on the 5' end. PCR products 
were purified in a 96-well format using QlAquick columns 
(Oiagen). 

Microarray Preparation. Amino-modified PCR products 
were suspended at a concentration of 0.5 mg/ml in 3x 
standard saline citrate (SSC) and arrayed from 96-well micro- 
liter plates onto sUylated microscope slides (CEL Associates 
Houston) using high-speed robotics (4-7). A total of 1056 
cDNAs, representing 1046 human clones and 10 Arabidopsu 
controls, were arrayed in 1.0-cm 2 areas. Printed arrays were 
mcuuatcu ior ^ nr in a humid chamber to allow rehydration of 
the array elements and rinsed, once in 0.2% SDS*for 1 min 
twice in H 2 0 for 1 min, and once for 5 min in sodium 
borohydride solution (1.0 g of NaBH4 dissolved in 300 ml of 
PBS and 100 ml of 100% ethanol). The arrays were submerged 
in H 2 0 for 2 min at 95°Q transferred quic'klv into 0.2% SDS 
for 1 mm, rinsed twice in H 2 0. air dried, and stored in the dark 
at 25°C. 

jnjJS~ nl Pr0bcS * Tlssuc mRNAs w <=re purchased 
(CLONTECH). Jurkat mRNA was isolated as described by 
Schena et al (4). Probes were made as described (4) with 
several modifications. The reverse transcriptase used here was 
Superscript II RNase H- (G1BCO). The Cy5-dCTP was 
purchased from Amersham. Each reverse transcription reac- 
tion contained 3.0 M g of total human mRNA. Arabidopsis 
control mRNAs were made by in vitro transcription of cloned 
HAT4, HAT22, and YesAt-23 cDNAs (4, 8, 9) using an RNA 
Transcription Kit (Stratagene). For quantitation, the mRNAs 
were doped into the reverse transcription reaction at ratios of 
1:100,000, 1:10,000, and 1:1000 (wt/wt) respectively. Following 
the reverse transcription step, samples were treated with 2.5 /il 
of 1 M sodium hydroxide for 10 min at 37°C, then neutralized 
by adding 2.5 M l of 1 M Tris-HCl (pH 6.8) and 2.0 M I of 1 M 
HCI. Probe mixtures contained cDNA products derived from 
3 Mg of total mRNA, suspended in 5.0 pd of hybridization 
buffer (5x SSC plus 0.2% SDS). 

Hybridization and Scanning. Probes were hybridized to 
1.0-cm microarrays under a 14 X 14 mm glass coverslip for 
6-12 hr at 60°C in a custom-built hybridization chamber (4-7). 
Arrays were washed for 5 min at room temperature (25°C) in 
low stringency wash buffer (1 X SSC/0.2% SDS), then for 10 
min at room temperature in high stringency wash buffer (0.1 X 
SSC/0.2% SDS). Arrays were scanned in 0.1 x SSC using a 
fluorescence laser scanning device (4-7), fitted with a custom 
filter set (Chroma Technology, Brattleboro, VT). Accurate 
differentia] expression measurements (i.e., final fluorescence 
ratios) were obtained by taking the average of the ratios of two 
independent hybridisations. 
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Biochemistry: Schena et al. 

Cell Culture. Jurkat cells were grown in a tissue culture 
incubator (37°C and 5% CO : ) in RPMI medium supplemented 
with 10% fetal bovine serum, 100 of streptomycin per mi 
and 500 units of penicillin per ml. Heat shock corresponded to 
a 4-hr incubation at 43°C. Phorbo) ester treated ceils were 
grown for 4 hr in the presence of 50 ng of phorboi 12-myristate 
B-acetate (PMA) per ml. 
RNA Blotting. Dot blots were performed as described (4). 
DNA Sequencing. Sequences were obtained using the 
PAN132 and PAN133 primers and a 373A automated se- 
quencer, according to the instructions of the manufacturer 
(Applied Biosystems). 

Computer Graphics and Informatics. Pseudocolor represen- 
tations of fluorescent images were made with National Institutes 
of Health image software (version 1.52). Software for differential 
expression representations was purchased from Imaging Re- 
search (St. Catherine's, ON, Canada). Sequence searches were 
made to the nonredundant nucleotide data base at the National 
Center for Biotechnology Information (NCBI) using Macintosh 
blast software. The EST data base was accessed via the World 
Wide Web (http:/www.ncbi.nlm.nih.gov/). 

RESULTS 

Gene Discovery and the Heat Shock Response. Microarravs 
were usea to examine the heat shock response in cultured 

"!? T (Jurkat) cells. Control (37*C) and heat-treated 
(43 C) cells were harvested and rysed, and total mRNA from 
the two cell samples was labeled by reverse transcriptase 
incorporation of fluorescein- and Cy5-dCTP, respectively In 
a second set of labeling reactions, the fluorescent groups were 

swapped" such that samples from control and heat-treated 



A. 



-Heat Shock 
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samples were labeled with Cy5- and fluorescem-dCTP. respec- 
«ve£. Each pair of fluorescent probes wa, hybridized to a 
1056-element microarray. He arrays were washed a. h.ch 
stringency and scanned with a confocal laser scannins device 
to detect emission of the two fluorescent eroups. * 

Hybridization signals were observed to >95% of the human 
cDNA array elements, but not to any of the Arabidopsis 
negatrve controls (Fig. 1). Fluorescence intensit.es spanned 
more than three orders of magnitude for the 1046 array 
elements surveyed (Fig. 1). Comparative expression analysis of 
heat shocked versus control cells in the two experiments 
revealed 17 array elements that d.splayed altered fluorescence 
ra ,os of js 2 .0-fold (Figs. 1 and 2A). Of the 17 putative 
different.ally expressed genes. 11 were induced by heat shock 
treatment and 6 displayed modest repression (Figs. 1 and 2A) 
To determine the identity of the heat-re e ulated eenes 
cDNAs corresponding to each of the 17 array elements were 
sequenced on the proximal and distal end. Data base searches 
revealed perfect matches for 14 of the 17 clones, and in each 

^ne P ^ X h? a if 1??' CDNA se 9 uen * s ™PI*d to the same 
gene (Table 1). Of the 1046 human genes examined on the 
microarray the five most highly induced in heat-treated cells 
were heat shock protein 90 Q (hs P 90a), dnaJ. hs P 90f3, polyu- 
5f1™ n , n 7 " t " C ° I ? Pl " P^W^-l ttcp-1) (Table 1). Three 

thZIC '~"""7.u " maiLh My en,r y ln ,ne P ublic data base. 

hough one of the clones (B7) exhibited significant homology 
to an EST from CaenorhabdUis elegans (Table 1). Each of the 
novel sequences (B7-B9) exhibited -2-fold induction (Tab e l 
and relatively low-level expression (Table 2). 
To confirm the microarray results, mRNA levels for each of 

he genes were measured by RNA blotting. Each of the genes 
that displayed heat shock induction, including the three novel 



B.l 



+Heat Shock 



I-";- . : — • • -^^*»^ W.: ..: f.< 

' v; <' ••••• 9 ■• v 0 ,. " [ ' ] 



: * 

'■ « • > ■" v. ., 



10 """"" 



Expression Level (per 100,000) 



FjG l ; ^^^^^^^nitoredonamicroarrav . nuorescxnt ^ru represent^ ,n 
"" - ; "tn>is 'upper trf ...mr- -icment^ . 4, r , 

' atCw in separate cxpenmcnii uimc itnown nuanitit^ i ^ t » rt * ^ ^ -»-.»se^ ,^rccn buxcs i m nc^ii sdik t 

l^t! and column, Ia i th. „^-, ... ! ,^ UanUUcs < * l / wl > oi Arabidopsu control mRNAs added tn th, i=.k^i — 

- ■ r — — "»^«" »i element increments (while circles). (Bar = 1 mm . } " r- - • 



1 ! j c 



i ne color bar was 



10616 Biochemistry: Schena et al 

-/+ Heat Shock 



Proc. Nad. Acad. Sci. U14 93 (1996) 

-/+ Phorbol Ester 




^^iS« 0.5- 2.0 



Expression Ratios 

Fig. 2. Elemental displays of activated and repressed ecucs Flunrru-rnrr r^n™ „r „ 
scbemadcally. Fluorescein-labeled probes from Jurkat^lU Eted ™U??JT^v L^T^ 0m ' y (Fig ' l) ™ 

Cy5-labeled probes from untreated cells. In a second set of reanions hefTu^ ph °^? 1 Mler tJeatment » ere "«*»«<» 

of the ratios from two hvbridizations. excluding values in whichT' dHfc rnr!^,f i P"" 1 (S " teX,) ' The data "P'«e'» the average 
bar corresponds to express™ ra.os! «^ ^ i 2SSS^t1S^i^^^Jr^ , ^ ^ ^ ^ ra "°- 11,6 

Table 1. Microarray elements corresponding to differe ntially expressed genes 
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Table 2. Human gene expression monitored by microarray and RNA blot analyses 
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sequences, exhibited elevated mRNA levels by dot blot analysis 
(Table 2). In all cases, expression ratios as determined by the 
two procedures differed by <2-fold for the genes identified in 
the heat shock experiments (Table 2). The two assays differed 
more widely in terms of assessing absolute expression ievels- 
nonetheless, absolute expression as monitored on a microarray 
typically correlated with RNA blots to within a factor of five 
(Table 2). 

Phorbo! Ester Signaling. To explore a signaling pathway 
distinct from the heat shock response, microarrays were used 
to examine the cellular effects of phorbol ester treatment 
Jurkat cells were treated with phorbol ester, harvested lysed 
and used as a source of mRNA. Samples of mRNA from 
untreated or phorbol ester-stimulated cells were labeled with 
reverse transcriptase. The probes were mixed, hybridized to 
microarrays, and scanned for fluorescence emission of the two 
fluorescent groups. A total of six array elements displayed 
^2.0-fold elevated signals with probes from phorbol ester- 
treated cells relative to control samples (Fig. 2B). 

To determine the identity of the phorbol ester-induced 
genes, clones corresponding to the six array elements were 
sequenced. Data base searches revealed perfect matches for 
five of the six sequences (Table 1). The two most highly 
induced genes were the PAC-1 tyrosine phosphatase and 
nuclear factor-kappa Bl (NF-kB1); modest activation was 
?rl7 ed 5? P hos P no ^y ce rate kinase and ^microglobulin 
( iable 1). One remaining clone (B19) did not match any entry 
in the public data base (Table 1). B19 displayed a 2 1-fold 
induction and, similar to the novel heat shock genes a rela- 
tively low absolute expression level (Tables 1 and 2) All six of 
the phorbol ester-inducible genes displayed increased steady- 
state mRNA levels by RNA blotting (Table 2). PAC-1 expre<- 
<!on (Fie. 1: Tabic ?\ defined .* i;^:. < 1 rr ,r rj - 



row w ^ n :^ stale ' and hearT by Iabeiin * each mRNA sa *Ple 

with CyS-dCTP. In a separate reaction, a control probe was 
prepared by labeling Jurkat mRNA with fluorescein-dCTP 
The Tour Cy5-labeled probes were each mixed with an aliquot 
of the fluorescein-labeled control sample, and the four mix- 
tures were hybridized to separate microarrays. The arrays were 
washed and scanned for fluorescence emission, and hybrid- 
ization signals for each of the tissues samples were normalized 
to the Jurkat control to generate an expression profile for each 
of the 1046 clones present on the array. 

Detectable expression was observed for all 15 of the heat 
shock and phorbol ester-regulated genes in the four tissue 
types examined (Fig. 3). In general, the expression level of each 
gene in Jurkat cells correlated rather closely with expression in 
the four tissues (Table 2; Fig. 3). Genes encoding 0-actin and 
cytochrome c oxidase, the two most highly expressed of the 15 
genes in Jurkat cells (Table 2), were highly expressed in bone 
marrow, bram, prostate, and heart (Fig. 14). Expression of 
cytochrome c oxidase, hsp90 Q , and the novel B7 sequence was 
significantly greater in heart than in the other tissues (Fig. 3). 

DISCUSSION 

Many of the heat shock genes identified in this study encode 
factors that function either as molecular "chaperones" 
(HSP90a. HSP9O0, DnaJ, TCP-1) or as mediators of protein 
degradation (polyubiquitin). The identification of these se- 
quences is consistent with the biochemical basis of heat shock 
induction (10-15). Proteins undergo denaturation at elevated 
temperatures, and those that fail to maintain proper confor- 
mation must be selectively degraded (10-15). It will be inter- 
ring to determine whether the rhree novei heat shork- 



juman tissues, probes were prepared h--- 



xpreisiuii 



monitoring, ana Oiocnemicai anaivsis should pro- 
a ucidiitu iuuLuonai understanding of these genes. 
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Phorbol ester ,a potent activator of protein kinase C( 16 17) 
lit? l M 0f ni en " distinet £rom ,hose involv ^ 'he heat 

tudt E.rT ^ m ° St h,gbiy iDduced * ene identified «S 
study. PAC-1, encodes a nuclear tyrosine kinase that mav plav 

M8? T 8 lra T scri P ti0 " a " d cell cycle profession 
(18). NF-kB1, a second phorbol ester-inducible gene is an 
intensely studied member of the Rel transcription factor 
fam>ly (19-21). The Rel proteins are activated bv a Targe 
number of stimuli, including phorbol esters, cytokines, bacte- 
rial and viral pathogens, and ultraviolet light (19-21) Modest 
actuation was observed for three sequences not known °o be 
inducible by phorbol esters, including phosphogivcerate ki- 
nase, ^-microglobulin, and a novel human gene (B19) Ex- 

undTru!n P H re$Si r m0ni, ° rin S wi, h microarrays should assist in 
understanding how each of these genes integrate into the 
highly complex phorbol ester signaling pathway- 
It is sinking that four novel human genes were discovered 
w. h an array of 1000 randomly chosen clones. par, cX, v 

vl been I ShOCk K aDd Ph0rb °' " ter pathway 
have been so mtens.very studied (10-21). The facile discovery 

be used' fo? UenC8i ," nderScores the l*« '^t microarrays can 
be used for gene discovery in the absence of any sequence 

from any Lbrary of interest and only those clones that display 
,°^ ° g el P« ss,on are sequenced and character- 

«say. cuupied with a modest DNA sequenc- 

SSSSfiS 5825*" "-— — 

Genes that are activated or repressed by a given stimulus 
provide functional clues to the cellular pathway S3 
(22-24). Detailed examination of these gene expression "sig- 
natures can provide a dynamic view of the mode of action of 
a given signaling substance (22-24). Microarrays may thus 
a low rapid mechanistic examination of hormones drugs 
elicitors. and other small molecules; moreover, functional 
analysis of transcription factors, kinases, growth factors, cyto- 
kines receptors, and other gene products should be possible 

Ts Z\TJ u y IO deve '° p mRNA a *P'ifi«tion strata 
gies to enable probe preparation from minute tissue samples 
This capabili ty might allow for high-throughput patient screen- 
ing in a clinical setting. 

The current detection limit of the assav allows monitoring of 

mRNA Th,', , a L re ,r Sen ' ~ 1:500 - 000 Vt/w.) of the total 
, Tn,s 10 " fold increase in sensitivity compared with the 
original report (4) was achieved largely by modifying the 
coupling chemistry, which reduced background fluorescence 
I he significance of this improvement is considerable in that 
approximately half the human genes identified ,n this study 

WowZ ,T e ' 5eqUe " CeS - exhibited "P ressi °" leveU 

below the onginal detection limit of 1:50.000 (4) 

ailvJ h'",l '° de 'f Ct 2 - f °' d Chan * es in expression was 
achieved by the use of two<olor fluorescence in the labeling 
and detection schemes, digitized data collection, and custom 
h.fT; k ,m P° r,ance of 'his capability is underscored by 
■ he fact that nearly all of the genes examined here exhibited 

showed <iTJ" eXpreSS,on The , four genes, which 
showed s 2.2-fold activation, were probably overlooked in 
previous screens that used conventional differential expression 
techmoues I. may be possible to further improve the precision 
of the m.croarray assay by the use of closely related fluorescent 
analogs, such as Cy3 and Cy5. in the labeling and hybridization 
reactions. 

Microarrays offer a number of advantages over other po- 
"Ht hl 5 h - ca P aclt >' approaches to expression analysis. The 

wi.p-D35Cu 3pDTuH t h pn shjpt SIT! 3 I ! h Vh Ti C*i i i 7 1 f i \i n i ,,.,! nmo , u ' u 



coruintt to expression levels 
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• — :ii \Peuhenv mat is not 



readily attained with ohgo 
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nucleotide arrays (27-30). The parallel format of the assay 
provides a simultaneous differential expression readout for 
>1000 genes. This contrasts with sequencing-based methods, 
which require serial data collection for expression analysis (3l! 
32). A commercial source of cDNA microarrays would greatly 
speed the use of a chip-based approach to expression analysis. 

The availability of large numbers of ESTs (3) provides a rich 
resource of human cDNA clones for micro arraying. The 
>400,000 ESTs in the public data bases represent a significant 
subset of all human genes (3, 33). Microarrays of thousands of 
ESTs will provide a powerful analytical tool for future human 
gene expression studies. The -100,000 genes in the human 
genome (2, 33) emphasize the need for microarrays of greater 
density. Attempts to improve microdeposition techniques are 
underway and should allow construction of arrays containing 
a complete set of human gene targets (http://cmgm.st anford 
edu/-schena/). Microarrays of -100,000 cDNA elements 
would allow expression monitoring of the entire human ge- 
nome in a single hybridization. This capacity, coupled with 
detailed biochemical analysis of the individual gene products, 
would greatly speed the functional analysis of the human 
genome. 
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Pield of the Invention 

5 This invention relates to a method and apparatus 

for fabricating microarrays of biological samples for 
large scale screening assays, such as arrays of DNA 
samples to be used in DNA hybridization assays for 
genetic research and diagnostic applications • 
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Background of the Invention 

A variety of methods are currently available for 
making arrays of biological macromolecules , such as 

10 arrays of nucleic acid molecules or proteins. One 
method for making ordered arrays of DNA on a porous 
membrane is a "dot blot" approach. In this method, a 
vacuum manifold transfers a plurality. e-gN, 96, 
aqueous samples of DNA from 3 millimeter diameter wells 

15 to a porous membrane. A common variant of this 

procedure is a "slot-blot" method in which the wells 
have highly-elongated oval shapes. 

The DNA is immobilized on the porous membrane by 
baking the membrane or exposing it to UV radiation. 

2 0 This is a manual procedure practical for making one 

array at a time and usually limited to 96 samples per 
array. "Dot-blot" procedures are therefore inadequate 
for applications in which many thousand samples must be 
determined . 

25 A more efficient technique employed for making 

ordered arrays of genomic fragments uses an array of 
pins dipped into the wells, e.g., the 96 wells of a 
microtitre plate, for transferring an array of samples 
to a substrate, such as a porous membrane. One array 

30 includes pins that are designed to spot a membrane in a 
staggered fashion, for creating an array of 9216 spots 
in a 22 x 22 cm area (Lehrach, et al . , 1990) . A 
limitation with this approach is that the volume of DNA 
spotted in each pixel of each array is highly variable. 
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In addition, the number of arrays that can be made with 
each dipping is usually quite small. 

An alternate method of creating ordered arrays of 
nucleic acid sequences is described by Pirrung, et al. 
5 (1992), and also by Fodor, et al. (1991). The method 
involves synthesizing different nucleic acid sequences 
at different discrete regions of a support. This 
method employs elaborate synthetic schemes, and is 
generally limited to relatively short nucleic acid 

10 sample, e.g., less than 20 bases. A related method has 
been described by Southern, et al. (1992). 

Khrapko, et al. (1991) describes a method of 
making an oligonucleotide matrix by spotting DNA onto a 
thin layer of polyacrylamide. The spotting is done 

15 manually with a micropipette. 

None of the methods or devices described in the 
prior art are designed for mass fabrication of 
microarrays characterized by (i) a large number of 
micro-sized assay regions separated by a distance of 

20 50-200 microns or less, and (ii) a well-defined amount, 
typically in the picomole range, of analyte associated 
with each region of the array. 

Furthermore, current technology is directed at 
performing such assays one at a time to a single array 

25 of DNA molecules. For example, the most common method 
for performing DNA hybridizations to arrays spotted 
onto porous membrane involves sealing the membrane in a 
plastic bag (Maniatas, et al., 1989) or a rotating 
glass cylinder (Robbins Scientific) with the labeled 

3 0 hybridization probe inside the sealed chamber. For 
arrays made on non-porous surfaces, such as a : 
microscope slide, each array is incubated with the 
labeled hybridization probe sealed under a coverslip. 
These techniques require a separate sealed chamber for 
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each array which makes the screening and handling of 
»any such arrays inconvenient and time intensive. 

Abouzied, et al. (1994) describes a method of 
printing horizontal lines of antibodies on a 
nitrocellulose membrane and separating regions of the 
membrane with vertical stripes of a hydrophobic 
material. Each vertical stripe is then reacted with a 
different antigen and the reaction between the 
immobilized antibody and an antigen is detected using a 
standard ELISA color imetric technique. Abouzied 's 
technique makes it possible to screen many one- 
dimensional arrays simultaneously on a single sheet of 
nitrocellulose. Abouzied makes the nitrocellulose 
somewhat hydrophobic using a line drawn with PAP Pen 
(Research Products International) . However Abouzied 
does not describe a technology that is capable of 
completely sealing the pores of the nitrocellulose. The 
pores of the nitrocellulose are still physically open 
and so the assay reagents can leak through the 
hydrophobic barrier during extended high temperature 
incubations or in the presence of detergents which 
makes the Abouzied technique unacceptable for DNA 
hybridization assays. 

Porous membranes with printed patterns of 
hydrophilic/hydrophobic regions exist for applications 
such as ordered arrays of bacteria colonies, qa Life 
Sciences (San Diego CA) makes such a membrane with a 
grid pattern printed on it. However, this membrane has 
the same disadvantage as the Abouzied technique since 
reagents can still flow between the gridded arrays 
making them unusable for separate DNA hybridization 
assays . 

Pall Corporation make a 96-well plate with a 
porous filter heat sealed to the bottom of the plate. 
35 These plates are capable of containing different 
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reagents in each well without cross-contamination. 
However, each well is intended to hold only one target 
element whereas the invention described here makes a 
microarray of many biomolecules in each subdivided 
region of the solid support. Furthermore, the 96 well 
Plates are at least 1 cm thick and prevent the use of 
the device for many colorimetric, fluorescent and 
radioactive detection formats which require that the 
membrane lie flat against the detection surface. The 
invention described here requires no further processing 
after the assay step since the barriers elements are 
shallow and do not interfere with the detection step 
thereby greatly increasing convenience . 

Hyseq Corporation has described a method of making 
an "array of arrays" on a non-porous solid support for 
use with their sequencing by hybridization technique. 
The method described by Hyseq involves modifying the 
chemistry of the solid support material to form a 
hydrophobic grid pattern where each subdivided region 
contains a microarray of biomolecules. Hyseq 's flat 
hydrophobic pattern does not make use of physical 
blocking as an additional means of preventing cross 
contamination . 

25 """""T of tha Tnventioti 

The invention includes, in one aspect, a method of 
forming a microarray of analyte-assay regions on a 
solid support, where each region in the array has a 
known amount of a selected, analyte-specif ic reagent. 

30 The method involves first loading a solution of a 
selected analyte-specif ic reagent in a reagent- 
dispensing device having an elongate capillary channel 
(i) formed by spaced-apart, coextensive elongate 
members, (ii) adapted to hold a quantity of the reagent 

3 5 solution and (iii) having a tip region at which aqueous 
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solution in the channel forms a meniscus. The channel 
is preferably formed by a pair of spaced-apart tapered 
elements. 

The tip of the dispensing device is tapped against 
5 a solid support at a defined position on the support 

surface with an impulse effective to break the meniscus 
in the capillary channel deposit a selected volume of 
solution on the surface, preferably a selected volume 
in the range 0.01 to 100 nl. The two steps are 

10 repeated until the desired array is formed. 

The method may be practiced in forming a plurality 
of such arrays, where the solution-depositing step is 
are applied to a selected position on each of a 
plurality of solid supports at each repeat cycle. 

15 The dispensing device may be loaded with a new 

solution, by the steps of (i) dipping the capillary 
channel of the device in a wash solution, (ii) removing 
wash solution drawn into the capillary channel, and 
(iii) dipping the capillary channel into the new 

20 reagent solution. 

Also included in the invention is an automated 
apparatus for forming a microarray of analyte-assay 
regions on a plurality of solid supports, where each 
region in the array has a known amount of a selected, 

25 analyte-specif ic reagent. The apparatus has a holder 
for holding, at known positions, a plurality of planar 
supports, and a reagent dispensing device of the type 
described above. 

The apparatus further includes positioning 

3 0 structure for positioning the dispensing device at a 
selected array position with respect to a support in 
said holder, and dispensing structure for moving the 
dispensing device into tapping engagement against a 
support with a selected impulse effective to deposit a 
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selected volume on the support, e.g., a selected volume 
in the volume range 0.01 to 100 nl. 

The positioning and dispensing structures are 
controlled by a control unit in the apparatus. The 
* unit operates to (i) place the dispensing device at a 
loading station, (ii) move the capillary channel in the 
device into a selected reagent at the loading station, 
to load the dispensing device with the reagent, and 
(iii) dispense the reagent at a defined array position 
10 on each of the supports on said holder. The unit may 
further operate, at the end of a dispensing cycle, to 
wash the dispensing device by (i) placing the 
dispensing device at a washi 

-* — — — r \ / v wuc 

capillary channel in the device into a wash fluid, to 
load the dispensing device with the fluid, and (iii) 
remove the wash fluid prior to loading the dispensing 
device with a fresh selected reagent. 

The dispensing device in the apparatus may be one 
of a plurality of such devices which are carried on the 
arm for dispensing different analyte assay reagents at 
selected spaced array positions. 

In another aspect, the invention includes a 
substrate with a surface having a microarray of at 
least 10 3 distinct polynucleotide or polypeptide 
biopolymers in a surface area of less than about 1 cm 2 . 
Each distinct biopolymer (i) is disposed at a separate, 
defined position in said array, (ii) has a length of at 
least 50 subunits, and (iii) is present in a defined 
amount between about 0.1 femtomoles and 100 nanomoles. 

In one embodiment, the surface is glass slide 
surface coated with a polycationic polymer, such as 
poly lysine, and the biopolymers are polynucleotides. 
In another embodiment, the substrate has a water- 
impermeable backing, a water-permeable film formed on 
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the backing, and a grid formed on the film. The grid 
is composed of intersecting water- impervious grid 
elements extending from said backing to positions 
raised above the surface of said film, and partitions 
5 the film into a plurality of water-impervious cells. A 
biopolymer array is formed within each well. 

More generally, there is provided a substrate for 
use in detecting binding of labeled polynucleotides to 
one or more of a plurality different-sequence, 

10 immobilized polynucleotides. The substrate includes, 
in one aspect, a glass support, a coating of a 
polycationic polymer, such as poly lysine, on said 
surface of the support, and an array of distinct 
polynucleotides electrostatically bound non-covalently 

15 to said coating, where each distinct biopolymer is 

disposed at a sepaxate, defined position in a surface 
array of polynucleotides. 

In another aspect, the substrate includes a water- 
impermeable backing, a water-permeable film formed on 

20 the backing, and a grid formed on the film, where the 
grid is composed of intersecting water- impervious grid 
elements extending from the backing to positions raised 
above the surface of the film, forming a plurality of 
cells. A biopolymer array is formed within each cell. 

25 Also forming part of the invention is a method of 

detecting differential expression of each of a 
plurality of genes in a first cell type, with respect 
to expression of the same genes in a second cell type. 
In practicing the method, there is first produced 

3 0 fluorescent-labeled cDNA's from mRNA's isolated from 
the two cells types, where the cDNA'S from the first 
and second cells are labeled with first and second 
different fluorescent reporters. 

A mixture of the labeled cDNA's from the two cell 

35 types is added to an array of polynucleotides 
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representing a plurality of known genes derived from 
the two cell types, under conditions that result in 
hybridization of the cDNA's to complementary-sequence 
polynucleotides in the array. The array is then 
5 examined by fluorescence under fluorescence excitation 
conditions in which (i) polynucleotides in the array 
that are hybridized predominantly to cDNA's derived 
from one of the first and second cell types give a 
distinct first or second fluorescence emission color, 
10 respectively, and (ii) polynucleotides in the array 

that are hybridized to substantially equal numbers of 
cDNA's derived from the first and second cell types 
give a distinct combinpH -ft 

" — — — — » . ^ *.*tt*biWAWll , 

respectively. The relative expression of known genes 
15 in the two cell types can then be determined by the 

observed fluorescence emission color of each spot. 
These and other objects and features of the 

invention will become more fully apparent when the 

following detailed description of the invention is read 
20 in conjunction with the accompanying figures. 

Brief Description of the Drawings 

Fig. 1 is a side view of a reagent -dispensing 
device having a open-capillary dispensing head 
25 constructed for use in one embodiment of the invention; 

Figs. 2A-2C illustrate steps in the delivery of a 
fixed-volume bead on a hydrophobic surface employing 
the dispensing head from Fig. l, in accordance with one 
embodiment of the method of the invention; 
30 Fig. 3 shows a portion of a two-dimensional array 

of analyte-assay regions constructed according to the 
method of the invention; 

Fig. 4 is a planar view showing components of an 
automated apparatus for forming arrays in accordance 
35 with the invention. 
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Fig. 5 shows a fluorescent image of an actual 20 x 
20 array of 400 f luorescently-labeled DNA samples 
immobilized on a poly-l-lysine coated slide, where the 
total area covered by the 4 00 element array is 16 
5 square millimeters; 

Fig. 6 is a fluorescent image of a 1.8 cm x i.s cm 
microarray containing lambda clones with yeast inserts, 
the fluorescent signal arising from the hybridization 
to the array with approximately half the yeast genome 

10 labeled with a green f luorophore and the other half 
with a red f luorophore; 

Fig. 7 shows the translation of the hybridization 
image of Fig. € into a karyotype of the yeast genome, 
where the elements of Fig. -6 microarray contain yeast 

15 DNA sequences that have been previously physically 
mapped in the yeast genome; 

Fig. 8 show a fluorescent image of a 0.5 cmx 0.5 
cm microarray of 24 cDNA clones, where the microarray 
was hybridized simultaneously with total cDNA from wild 

20 type AraJbidopsis plant labeled with a green fluorophore 
and total cDNA from a transgenic Arabidopsis plant 
labeled with a red fluorophore, and the arrow points to 
the cDNA clone representing the gene introduced into 
the transgenic AraJbidopsis plant; 

25 Fig. 9 shows a plan view of substrate having an 

array of cells formed by barrier elements in the form 
of a grid; 

Fig. 10 shows an enlarged plan view of one of the 
cells in the substrate in Fig. 9, showing an array of 
30 polynucleotide regions in the cell; 

Fig. 11 is an enlarged sectional view of "the 
substrate in Fig. 9, taken along a section line in that 
f igure ; and 

Fig. 12 is a scanned image of a 3 cm x 3 cm 
35 nitrocellulose solid support containing four identical 
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arrays of M13 clones in each of four quadrants, where 
each quadrant was hybridized simultaneously to a 
different oligonucleotide using an open face 
hybridization method. 

5 

Detailed Description of the Invention 

I- Definitions 

Unless indicated otherwise, the terms defined 
below have the following meanings: 

!0 "Ligand" refers to one member of a ligand/ anti- 

ligand binding pair. The ligand may be, for example, 
one of the nucleic acid strands in a complementary, 
hybridized nucleic acid duplex binding paij: an 
effector molecule in an effector /receptor binding pair; 

15 or an antigen in an antigen/ antibody or 
antigen/antibody fragment binding pair. 

"Antiligand" refers to the opposite member of a 
ligand/anti-ligand binding pair. The antiligand may be 
the other of the nucleic acid strands in a 

20 complementary, hybridized nucleic acid duplex binding 
pair; the receptor molecule in an effector /receptor 
binding pair; or an antibody or antibody fragment 
molecule in antigen/ antibody or antigen/antibody 
fragment binding pair, respectively. 

25 "Analyte" or "analyte molecule" refers to a 

molecule, typically a macromolecule, such as a 
polynucleotide or polypeptide, whose presence, amount, 
and/or identity are to be determined. The analyte is 
one member of a ligand/anti-ligand pair. 

30 "Analyte-specif ic assay reagent" refers to a 

molecule effective to bind specifically to an ^analyte 
molecule. The reagent is the opposite member of a 
ligand/anti-ligand binding pair. 

An "array of regions on a solid support" is a 

35 linear or two-dimensional array of preferably discrete 
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regions, each having a finite area, formed on the 
surface of a solid support. 

A "microarray" is an array of regions having a 
density of discrete regions of at least about 100/cm 2 , 
and preferably at least about 1000/ cm 2 . The regions in 
a microarray have typical dimensions, e.g., diameters, 
in the range of between about 10-250 /xm, and are 
separated from other regions in the array by about the 
same distance. 

A support surface is "hydrophobic" if a aqueous- 
medium droplet applied to the surface does not spread 
out substantially beyond the area size of the applied 

dronl £»t" . Thaf -i e? j__ _ 

* — — — — w , ****w aoua lu prevent 

spreading of the droplet applied to the surface by 
15 hydrophobic interaction with the droplet. 

A "meniscus" means a concave or convex surface 
that forms on the bottom of a liquid in a channel as a 
result of the surface tension of the liquid. 

"Distinct biopolymers" , as applied to the 
biopolymers forming a microarray, means an array member 
which is distinct from other array members on the basis 
of a different biopolymer sequence, and/or different 
concentrations of the same or distinct biopolymers, 
and/or different mixtures of distinct or different- 
25 concentration biopolymers. Thus an array of "distinct 
polynucleotides" means an array containing, as its 
members, (i) distinct polynucleotides, which may have a 
defined amount in each member, (ii) different, graded 
concentrations of given-sequence polynucleotides, 
30 and/or (iii) different-composition mixtures of two or 
more distinct polynucleotides. ; 

"Cell type" means a cell from a given source, 
e.g., a tissue, or organ, or a cell in a given state of 
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differentiation, or a cell associated with a given 
pathology or genetic makeup. 

11 • Method of Microar rav Formation 

This section describes a method of forming a 
microarray of analyte-assay regions on a solid support 
or substrate, where each region in the array has a 
known amount of a selected, analyte-specif ic reagent. 

Fig. 1 illustrates, in a partially schematic view, 
a reagent-dispensing device 10 useful in practicing the 
method . The device generally includes a reagent 
dispenser 12 having an elongate open capillary channel 
14 adapted to hold a quantity of the reagent solution, 
such as indicated at 16, as will be described below. 
The capillary channel is formed by a pair of spaced- 
apart, coextensive, elongate members 12a, 12b which are 
tapered toward one another and converge at a tip or tip 
region 18 at the lower end of the channel. More 
generally, the open channel is formed by at least two 
elongate, spaced-apart members adapted to hold a 
quantity of reagent solutions and having a tip region 
at which aqueous solution in the channel forms a 
meniscus, such as the concave meniscus illustrated at 
20 in Fig. 2A. The advantages of the open channel 
construction of the dispenser are discussed below. 

With continued reference to Fig. 1, the dispenser 
device also includes structure for moving the dispenser 
rapidly toward and away from a support surface, for 
effecting deposition of a known amount of solution in 
the dispenser on a support, as will be described below 
with reference to Figs. 2A-2C. In the embodiment 
shown, this structure includes a solenoid 22 which is 
activatable to draw a solenoid piston 24 rapidly 
downwardly, then release the piston, e.g., under spring 
bias, to a normal, raised position, as shown. The 
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dispenser is carried on the piston by a connecting 
member 26 , as shown. The just-described moving 
structure is also referred to herein as dispensing 
means for moving the dispenser into engagement with a 
5 solid support, for dispensing a known volume of fluid 
on the support. 

The dispensing device just described is carried on 
an arm 28 that may be moved either linearly or in an x- 
y plane to position the dispenser at a selected 

10 deposition position, as will be described. 

Figs. 2A-2C illustrate the method of depositing a 
known amount of reagent solution in the just-described 
dxspenser on the surface of a solid support/ such as 
the support indicated at 30. The support is a polymer, 

15 glass, or other solid-material support having a surface 
indicated at 31. 

In one general embodiment, the surface is a 
relatively hydrophilic, i.e., wettable surface, such as 
a surface having native, bound or covalently attached 

20 charged groups. On such surface described below is a 
glass surface having an absorbed layer of a 
polycationic polymer, such as poly-l-lysine. 

In another embodiment, the surface has or is 
formed to have a relatively hydrophobic character, 

25 i.e., one that causes aqueous medium deposited on the 
surface to bead. A variety of known hydrophobic 
polymers, such as polystyrene, polypropylene, or 
polyethylene have desired hydrophobic properties, as do 
glass and a variety of lubricant or other hydrophobic 

3 0 films that may be applied to the support surface. 

Initially, the dispenser is loaded with ar selected 
analyte-specif ic reagent solution, such as by dipping 
the dispenser tip, after washing, into a solution of 
the reagent, and allowing filling by capillary flow 

3 5 into the dispenser channel. The dispenser is now moved 
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to a selected position with respect to a support 
surface, placing the dispenser tip directly above the 
support-surface position at which the reagent is to be 
deposited. This movement takes place with the 
5 dispenser tip in its raised position, as seen in Fig. 
2A, where the tip is typically at least several 1-5 mm 
above the surface of the substrate. 

With the dispenser so positioned, solenoid 22 is 
now activated to cause the dispenser tip to move 

10 rapidly toward and away from the substrate surface, 
making momentary contact with the surface, in effect, 
tapping the tip of the dispenser against the support 
surface. The tapping movement of the tip against the 
surface acts to break the liquid meniscus in the tip 

15 channel, bringing the liguid in the tip into contact 
with the support surface. This, in turn, produces a 
flowing of the liquid into the capillary space between 
the tip and the surface, acting to draw liquid out of 
the dispenser channel, as seen in Fig. 2B. 

20 Fig. 2C shows flow of fluid from the tip onto the 

support surface, which in this case is a hydrophobic 
surface. The figure illustrates that liquid continues 
to flow from the dispenser onto the support surface 
until it forms a liquid bead 32. At a given bead size, 

25 i.e., volume, the tendency of liquid to flow onto the 
surface will be balanced by the hydrophobic surface 
interaction of the bead with the support surface, which 
acts to limit the total bead area on the surface, and 
by the surface tension of the droplet, which tends 

3 0 toward a given bead curvature. At this point,, a given 
bead volume will have formed, and continued contact of 
the dispenser tip with the bead, as the dispenser tip 
is being withdrawn, will have little or no effect on 
bead volume. 
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For liquid-dispensing on a more hydrophilic 
surface, the liquid will have less of a tendency to 
bead, and the dispensed volume will be more sensitive 
to the total dwell time of the dispenser tip in the 
5 immediate vicinity of the support surface, e.g., the 
positions illustrated in Figs. 2B and 2C. 

The desired deposition volume, i.e., bead volume, 
formed by this method is preferably in the range 2 pi 
(picoliters) to 2 nl (nanoliters) , although volumes as 

10 high as 100 nl or more may be dispensed. It will be 
appreciated that the selected dispensed volume will 
depend on (i) the "footprint" of the dispenser tip, 
-i • c . , the 5iZ£ of the area spanned t>y the tip, ( ii ) tiie 
hydrophobicity of the support surface, and (iii) the 

15 time of contact with and rate of withdrawal of the tip 
from the support surface. In addition, bead size may 
be reduced by increasing the viscosity of the medium, 
effectively reducing the flow time of liquid from the 
dispenser onto the support surface. The drop size may 

20 be further constrained by depositing the drop in a 
hydrophilic region surrounded by a hydrophobic grid 
pattern on the support surface. 

In a typical embodiment, the dispenser tip is 
tapped rapidly against the support surface, with a 

25 total residence time in contact with the support of 
less than about 1 msec, and a rate of upward travel 
from the surface of about 10 cm/ sec. 

Assuming that the bead that forms on contact with 
the surface is a hemispherical bead, with a diameter 

30 approximately equal to the width of the dispenser tip, 
as shown in Fig. 2C, the volume of the bead folrmed in 
relation to dispenser tip width (d) is given in Table 1 
below. As seen, the volume of the bead ranges between 
2 pi to 2 nl as the width size is increased from about 

35 20 to 200 /im. 
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L— ■ 


Volume (nl) 


20 


2 x io* 3 


50 


3.1 x 10' 2 


100 Mm 


2.5 x 10 1 


1 200 jra 


2 



35 



At a given tip size, bead volume can be reduced in 
a controlled fashion by increasing surface 
hydrophobicity, reducing time of contact of the tip 
with the surface, increasing rate of movement of the 
tip away from the surface, and/or increasing the 
viscosity of the medium. Once these parameters are 
fixed, a selected deposition volume in the desired pi 
to nl range can be achieved in a repeatable fashion. 

After depositing a bead at one selected location 
on a support, the tip is typically moved to a 
corresponding position on a second support, a droplet 
is deposited at that position, and this process is 
repeated until a liquid droplet of the reagent has been 
deposited at a selected position on each of a plurality 
of supports. 

The tip is then washed to remove the reagent 
liquid, filled with another reagent liquid and this 
reagent is now deposited at each another array position 
on each of the supports. In one embodiment, the tip is 
washed and refilled by the steps of (i) dipping the 
capillary channel of the device in a wash solution, 
(ii) removing wash solution drawn into the capillary 
channel, and (iii) dipping the capillary channel into 
the new reagent solution. 

From the foregoing, it will be appreciated that 
the tweezers-like, open-capillary dispenser tip 
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provides the advantages that (i) the open channel of 
the tip facilitates rapid, efficient washing and drying 
before reloading the tip with a new reagent, (ii) 
passive capillary action can load the sample directly 
5 from a standard microwell plate while retaining 

sufficient sample in the open capillary reservoir for 
the printing of numerous arrays, (iii) open capillaries 
are less prone to clogging than closed capillaries, and 
(iv) open capillaries do not require a perfectly faced 

10 bottom surface for fluid delivery. 

A portion of a microarray 36 formed on the surface 
38 of a solid support 40 in accordance with the method 
just described is shown in Fig. 3. The array is formed 
of a plurality of analyte-specif ic reagent regions, 

15 such as regions 42, where each region may include a 
different analyte-specif ic reagent . As indicated 
above, the diameter of each region is preferably 
between about 20-200 /im. The spacing between each 
region and its closest (non-diagonal) neighbor, 

20 measured from center-to-center (indicated at 44), is 

preferably in the range of about 20-400 fin. Thus, for 
example, an array having a center-to-center spacing of 
about 250 j*m contains about 40 regions/cm or 1,600 
regions /cm 2 . After formation of the array, the support 

25 is treated to evaporate the liquid of the droplet 

forming each region, to leave a desired array of dried, 
relatively flat regions. This drying may be done by 
heating or under vacuum. 

In some cases, it is desired to first rehydrate 

3 0 the droplets containing the analyte reagents to allow 
for more time for adsorption to the solid support. It 
is also possible to spot out the analyte reagents in a 
humid environment so that droplets do not dry until the 
arraying operation is complete. 



WO 95/35505 



PCT/US95/07659 



19 

111 - Automated Apparatus for F orming Arr^y g 

In another aspect, the invention includes an 
automated apparatus for forming an array of analyte- 
assay regions on a solid support, where each region in 
5 the array has a known amount of a selected, analyte- 
specific reagent. 

The apparatus is shown in planar, and partially 
schematic view in Fig. 4. A dispenser device 72 in the 
apparatus has the basic construction described above 
10 with respect to Fig. 1, and includes a dispenser 74 

having an open-capillary channel terminating at a tip, 
substantially as shown in Figs. 1 and 2A-2C. 

The dispenser is mounted in the device for 
movement toward and away from a dispensing position at 
15 which the tip of the dispenser taps a support surface, 
to dispense a selected volume of reagent solution, as 
described above. This movement is effected by a 
solenoid 76 as described above. Solenoid 76 is under 
the control of a control unit 77 whose operation will 
20 be described below. The solenoid is also referred to 
herein as dispensing means for moving the device into 
tapping engagement with a support, when the device is 
positioned at a defined array position with respect to 
that support. 

25 The dispenser device is carried on an arm 74 which 

is threadedly mounted on a worm screw 8 0 driven 
(rotated) in a desired direction by a stepper motor 82 
also under the control of unit 77. At its left end in 
the figure screw 80 is carried in a sleeve 84 for 

3 0 rotation about the screw axis. At its other end, the 
screw is mounted to the drive shaft of the stepper 
motor, which in turn is carried on a sleeve 86. The 
dispenser device, worm screw, the two sleeves mounting 
the worm screw, and the stepper motor used in moving 

35 the device in the "x" (horizontal) direction in the 
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figure form what is referred to here collectively as a 
displacement assembly 86. 

The displacement assembly is constructed to 
produce precise, micro-range movement in the direction 
5 of the screw, i.e., along an x axis in the figure. In 
one mode, the assembly functions to move the dispenser 
in x-axis increments having a selected distance in the 
range 5-25 jim. In another mode, the dispenser unit may 
be moved in precise x-axis increments of several 

10 microns or more,; for positioning the dispenser at 

associated positions on adjacent supports, as will be 
described below* 

The displacement assembly, in turn- is mounted for 
movement in the "y M (vertical) axis of the figure, for 

15 positioning the dispenser at a selected y axis 

position. The structure mounting the assembly includes 
a fixed rod 88 mounted rigidly between a pair of frame 
bars 90, 92, and a worm screw 94 mounted for rotation 
between a pair of frame bars 96, 98. The worm screw is 

2 0 driven (rotated) by a stepper motor 100 which operates 

under the control of unit 77. The motor is mounted on 
bar 96, as shown. 

The structure just described, including worm screw 
94 and motor 100, is constructed to produce precise, 
25 micro-range movement in the direction of the screw, 
i.e., along an y axis in the figure. As above, the 
structure functions in one mode to move the dispenser 
in y-axis increments having a selected distance in the 
range 5-250 fin, and in a second mode, to move the 

3 0 dispenser in precise y-axis increments of several 

microns (jra) or more, for positioning the dispenser at 
associated positions on adjacent supports. 

The displacement assembly and structure for moving 
this assembly in the y axis are referred to herein 
35 collectively as positioning means for positioning the 
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dispensing device at a selected array position with 
respect to a support. 

A holder 102 in the apparatus functions to hold a 
plurality of supports, such as supports 104 on which 
5 the microarrays of regent regions are to be formed by 
the apparatus. The holder provides a number of 
recessed slots, such as slot 106, which receive the 
supports, and position them at precise selected 
positions with respect to the frame bars on which the 
10 dispenser moving means is mounted. 

As noted above, the control unit in the device 
functions to actuate the two stepper motors and 

dicnencor eAlonni/1 i« a e» a**^.** ^ -i «. «a — a 
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operation of the apparatus in forming a selected 
15 microarray of reagent regions on each of a plurality of 
supports . 

The control unit is constructed, according to 
conventional microprocessor control principles, to 
provide appropriate signals to each of the solenoid and 

20 each of the stepper motors, in a given timed sequence 
and for appropriate signalling time. The construction 
of the unit, and the settings that are selected by the 
user to achieve a desired array pattern, will be 
understood from the following description of a typical 

25 apparatus operation. 

Initially, one or more supports are placed in one 
or more slots in the holder. The dispenser is then 
moved to a position directly above a well (not shown) 
containing a solution of the first reagent to be 

30 dispensed on the support (s). The dispenser solenoid is 
actuated now to lower the dispenser tip into this well, 
causing the capillary channel in the dispenser to fill. 
Motors 82, 100 are now actuated to position the 
dispenser at a selected array position at the first of 

35 the supports. Solenoid actuation of the dispenser is 
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then effective to dispense a selected-volume droplet of 
that reagent at this location. As noted above, this 
operation is effective to dispense a selected volume 
preferably between 2 pi and 2 nl of the reagent 
5 solution. 

The dispenser is now moved to the corresponding 
position at an adjacent support and a similar volume of 
the solution is dispensed at this position. The 
process is repeated until the reagent has been 

10 dispensed at this preselected corresponding position on 
each of the supports. 

Where it is desired to dispense a single reagent 
at more than two array positions On a Support, the 
dispenser may be moved to different array positions at 

15 each support, before moving the dispenser to a new 
support, or solution can be dispensed at individual 
positions on each support, at one selected position, 
then the cycle repeated for each new array position. 
To dispense the next reagent, the dispenser is 

20 positioned over a wash solution (not shown) , and the 
dispenser tip is dipped in and out of this solution 
until the reagent solution has been substantially 
washed from the tip. Solution can be removed from the 
tip, after each dipping, by vacuum, compressed air 

25 spray, sponge, or the like. 

The dispenser tip is now dipped in a second 
reagent well, and the filled tip is moved to a second 
selected array position in the first support. The 
process of dispensing reagent at each of the 

3 0 corresponding second-array positions is then carried as 
above. This process is repeated until an entire 
microarray of reagent solutions on each of the supports 
has been formed. 

3 5 IV. Microarray Substrate 
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This section describes embodiments of a substrate 
having a microarray of biological polymers carried on 
the substrate surface. Subsection A describes a multi- 
cell substrate, each cell of which contains a 
microarray, and preferably an identical microarray, of 
distinct biopolymers, such as distinct polynucleotides, 
formed on a porous surface. Subsection B describes a 
microarray of distinct polynucleotides bound on a glass 
slide coated with a polycat ionic polymer. 



A. Multi-Cel l Substrate 

Fig. 9 illustrates, in plan view, a substrate 110 
constructed according to the invention. The substrate 
has an 8 x 12 rectangular array 112 of cells, such as 

15 cells 114, 116, formed on the substrate surface. With 
reference to Fig. 10, each cell, such as cell 114, in 
turn supports a microarray 118 of distinct biopolymers, 
such as polypeptides or polynucleotides at known, 
addressable regions of the microarray. Two such 

20 regions forming the microarray are indicated at 120, 

and correspond to regions, such as regions 42, forming 
the microarray of distinct biopolymers shown in Fig. 3. 

The 96-cell array shown in Fig. 9 has typically 
array dimensions between about 12 and 244 mm in width 

25 and 8 and 400 mm in length, with the cells in the array 
having width and length dimension of 1/12 and 1/8 the 
array width and length dimensions, respectively, i.e., 
between about 1 and 20 in width and 1 and 50 mm in 
length. 

30 The construction of substrate is shown cross- 

sectionally in Fig. 11, which is an enlarged sectional 
view taken along view line 124 in Fig. 9. The 
substrate includes a water-impermeable backing 126, 
such as a glass slide or rigid polymer sheet. Formed 

35 on the surface of the backing is a water-permeable film 
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128. The film is formed of a porous membrane material, 
such as nitrocellulose membrane, or a porous web 
material, such as a nylon, polypropylene, or PVDF 
porous polymer material. The thickness of the film is 
5 preferably between about 10 and 1000 jxm. The film may 
be applied to the backing by spraying or coating 
uncured material on the backing, or by applying a 
preformed membrane to the backing. The backing and 
film may be obtained as a preformed unit from 

10 commercial source, e.g., a plastic-backed 

nitrocellulose film available from Schleicher and 
Schuell Corporation. 

With continued reference to Fig , 11 . the film- 
covered surface in the substrate is partitioned into a 

15 desired array of cells by water- impermeable grid lines, 
such as lines 130, 132, which have infiltrated the film 
down to the level of the backing, and extend above the 
surface of the film as shown, typically a distance of 
100 to 2000 A*m above the film surface, 

20 The grid lines are formed on the substrate by 

laying down an uncured or otherwise flowable resin or 
elastomer solution in an array grid, allowing the 
material to infiltrate the porous film down to the 
backing, then curing or otherwise hardening the grid 

25 lines to form the cell-array substrate. 

One preferred material for the grid is a flowable 
silicone available from Loctite Corporation. The 
barrier material can be extruded through a narrow 
syringe (e.g., 22 gauge) using air pressure or 

30 mechanical pressure. The syringe is moved relative to 
the solid support to print the barrier elements as a 
grid pattern. The extruded bead of silicone wicks into 
the pores of the solid support and cures to form a 
shallow waterproof barrier separating the regions of 

35 the solid support. 
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In alternative embodiments, the barrier element 
can be a wax-based material or a thermoset material 
such as epoxy. The barrier material can also be a UV- 
curing polymer which is exposed to UV light after being 
5 printed onto the solid support. The barrier material 
may also be applied to the solid support using printing 
techniques such as silk-screen printing. The barrier 
material may also be a heat-seal stamping of the porous 
solid support which seals its pores and forms a water- 

10 impervious barrier element. The barrier material may 
also be a shallow grid which is laminated or otherwise 
adhered to the solid support. 

In addition to plastic-backed nitrocellulose, the 
solid support can be virtually any porous membrane with 

15 or without a non-porous backing. Such membranes are 
readily available from numerous vendors and are made 
from nylon, PVDF, polysulfone and the like. In an 
alternative embodiment, the barrier element may also be 
used to adhere the porous membrane to a non-porous 

20 backing in addition to functioning as a barrier to 
prevent cross contamination of the assay reagents. 

In an alternative embodiment, the solid support 
can be of a non-porous material. The barrier can be 
printed either before or after the microarray of 

25 biomolecules is printed on the solid support. 

As can be appreciated, the cells formed by the 
grid lines and the underlying backing are water- 
impermeable, having side barriers projecting above the 
porous film in the cells. Thus, def ined-volume samples 

30 can be placed in each well without risk of cross- 
contamination with sample material in adjacent cells. 
In Fig. 11, defined volumes samples, such as sample 
134, are shown in the cells. 

As noted above, each well contains a microarray of 

3 5 distinct biopolymers. In one general embodiment, the 
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microarrays in the well are identical arrays of 
distinct biopolymers, e.g., different sequence 
polynucleotides. Such arrays can be formed in 
accordance with the methods described in Section II, by 
5 depositing a first selected polynucleotide at the same 
selected microarray position in each of the cells, then 
depositing a second polynucleotide at a different 
microarray position in each well, and so on until a 
complete, identical microarray is formed in each cell. 
10 In a preferred embodiment, each microarray 

contains about 10 3 distinct polynucleotide or 
polypeptide biopolymers per surface area of less than 
about 1 cm : . Also in a preferred embodiment, the 
biopolymers in each microarray region are present in a 
15 defined amount between about 0.1 femtomoles and 100 

nanomoles. The ability to form high-density arrays of 
biopolymers, where each region is formed of a well- 
defined amount of deposited material, can be achieved 
in accordance with the microarray-forming method 
20 described in Section II. 

Also in a preferred embodiments, the biopolymers 
are polynucleotides having lengths of at least about 50 
bp, i.e., substantially longer than oligonucleotides 
which can be formed in high-density arrays by schemes 
25 involving parallel, step-wise polymer synthesis on the 
array surface. 

In the case of a polynucleotide array, in an assay 
procedure, a small volume of the labeled DNA probe 
mixture in a standard hybridization solution is loaded 
3 0 onto each cell. The solution will spread to cover the 
entire microarray and stop at the barrier elements. 
The solid support is then incubated in a humid chamber 
at the appropriate temperature as required by the 
assay • 
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Each assay may be conducted in an "open-face" 
format where no further sealing step is required, since 
the hybridization solution will be kept properly 
hydrated by the water vapor in the humid chamber- At 
5 the conclusion of the incubation step, the entire solid 
support containing the numerous microarrays is rinsed 
quickly enough to dilute the assay reagents so that no 
significant cross contamination occurs. The entire 
solid support is then reacted with detection reagents 

10 if needed and analyzed using standard color imetric, 
radioactive or fluorescent detection means. All 
processing and detection steps are performed 
simultaneously to all of the microarrays on the solid 
support ensuring uniform assay conditions for all of 

15 the microarrays on the solid support. 

B. Glass-Slide Polynucleotide Array 
Fig. 5 shows a substrate 136 formed according to 
another aspect of the invention, and intended for use 

20 in detecting binding of labeled polynucleotides to one 
or more of a plurality distinct polynucleotides. The 
substrate includes a glass substrate 138 having formed 
on its surface, a coating of a polycat ionic polymer, 
preferably a cationic polypeptide, such as poly lysine 

25 or polyarginine. Formed on the polycat ionic coating is 
a microarray 140 of distinct polynucleotides, each 
localized at known selected array regions, such as 
regions 142. 

The slide is coated by placing a uniform-thickness 
30 film of a polycationic polymer, e.g., poly-l-lysine, on 
the surface of a slide and drying the film to ^ form a 
dried coating. The amount of polycationic polymer 
added is sufficient to form at least a monolayer of 
polymers on the glass surface. The polymer film is 
3 5 bound to surface via electrostatic binding between 
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negative silyl-OH groups on the surface and charged 
amine groups in the polymers. Poly-l-lysine coated 
glass slides may be obtained commercially, e.g., from 
Sigma Chemical Co. (St. Louis, MO). 
5 To form the microarray, defined volumes of 

distinct polynucleotides are deposited on the polymer- 
coated slide, as described in Section II. According to 
an important feature of the substrate, the deposited 
polynucleotides remain bound to the coated slide 

10 surface non-covalently when an aqueous DNA sample is 
applied to the substrate under conditions which allow 
hybridization of reporter-labeled polynucleotides in 
the sample to complementary-sequence (single-stranded) 
polynucleotides in the substrate array. The method is 

15 illustrated in Examples 1 and 2. 

To illustrate this feature, a substrate of the 
type just described, but having an array of same- 
sequence polynucleotides, was mixed with fluorescent- 
labeled complementary DNA under hybridization 

20 conditions. After washing to remove non-hybridized 
material, the substrate was examined by low-power 
fluorescence microscopy. The array can be visualized 
by the relatively uniform labeling pattern of the array 
regions . 

25 In a preferred embodiment, each microarray 

contains at least 10 3 distinct polynucleotide or 
polypeptide biopolymers per surface area of less than 
about 1 cm 2 . In the embodiment shown in Fig. 5, the 
microarray contains 400 regions in an area of about 16 

30 mm 2 , or 2.5 x 10 3 regions/cm 2 - Also in a preferred 

embodiment, the polynucleotides in the each microarray 
region are present in a defined amount between about 
0.1 femtomoles and 100 nanomoles in the case of 
polynucleotides. As above, the ability to form high- 
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density arrays of this type, where each region is 
formed of a well-defined amount of deposited material, 
can be achieved in accordance with the microarray- 
forming method described in Section II. 
5 Also in a preferred embodiments, the 

polynucleotides have lengths of at least about 50 bp, 
i.e., substantially longer than oligonucleotides which 
can be formed in high-density arrays by various in situ 
synthesis schemes. 

10 

V. Utility 

Microarrays of immobilized nucleic acid sequences 
prepared in accordance with the invention can be used 
for large scale hybridization assays in numerous 

15 genetic applications, including genetic and physical 

mapping of genomes, monitoring of gene expression, DNA 
sequencing, genetic diagnosis, genotyping of organisms, 
and distribution of DNA reagents to researchers. 

For gene mapping, a gene or a cloned DNA fragment 

20 is hybridized to an ordered array of DNA fragments, and 
the identity of the DNA elements applied to the array 
is unambiguously established by the pixel or pattern of 
pixels of the array that are detected. One application 
of such arrays for creating a genetic map is described 

25 by Nelson, et al . (1993). In constructing physical 
maps of the genome, arrays of immobilized cloned DNA 
fragments are hybridized with other cloned DNA 
fragments to establish whether the cloned fragments in 
the probe mixture overlap and are therefore contiguous 

3 0 to the immobilized clones on the array. For example, 
Lehrach, et al . , describe such a process. 

The arrays of immobilized DNA fragments may also 
be used for genetic diagnostics. To illustrate, an 
array containing multiple forms of a mutated gene or 

3 5 genes can be probed with a labeled mixture of a 
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patient's DNA which will preferentially interact with 
only one of the immobilized versions of the gene. 

The detection of this interaction can lead to a 
medical diagnosis. Arrays of immobilized DNA fragments 
5 can also be used in DNA probe diagnostics. For 

example, the identity of a pathogenic microorganism can 
be established unambiguously by hybridizing a sample of 
the unknown pathogen's DNA to an array containing many 
types of known pathogenic DNA. A similar technique can 
10 also be used for junambiguous genotyping of any 

organism. Other molecules of genetic interest, such as 
cDNA's and RNA's can be immobilized on the array or 

aiuexuaueiy Udcu a» uxic= mwcxcu 

applied to the array. 

15 In one application, an array of cDNA clones 

representing genes is hybridized with total cDNA from 
an organism to monitor gene expression for research or 
diagnostic purposes. Labeling total cDNA from a normal 
cell with one color fluorophore and total cDNA from a 

20 diseased cell with another color fluorophore and 

simultaneously hybridizing the two cDNA samples to the 
same array of cDNA clones allows for differential gene 
expression to be measured as the ratio of the two 
fluorophore intensities. This two-color experiment can 

25 be used to monitor gene expression in different tissue 
types, disease states, response to drugs, or response 
to environmental factors. & An example of this approach 
is illustrated in Examples 2, described with respect to 
Fig. 8. 

3 0 By way of example and without implying a 

limitation of scope, such a procedure could bemused to 
simultaneously screen many patients against all known 
mutations in a disease gene. This invention could be 
used in the form of, for example, 96 identical 0.9 cm x 

35 2.2 cm microarrays fabricated on a single 12 cm x 18 cm 
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sheet of plastic-backed nitrocellulose where each 
microarray could contain, for example, 100 DNA 
fragments representing all known mutations of a given 
gene. The region of interest from each of the DNA 
5 samples from 96 patients could be amplified, labeled, 
and hybridized to the 96 individual arrays with each 
assay performed in 100 microliters of hybridization 
solution. The approximately 1 thick silicone rubber 
barrier elements between individual arrays prevent 

10 cross contamination of the patient samples by sealing 
the pores of the nitrocellulose and by acting as a 
physical barrier between each microarray. The solid 
— rr wj - *- ^wnwaj.iiAay an iuiuiudiidys assayed wi*cn *cne 
9 6 patient samples is incubated, rinsed, detected and 

15 analyzed as a single sheet of material using standard 
radioactive, fluorescent, or color imetric detection 
means (Maniatas, et al., 1989), Previously, such a 
procedure would involve the handling, processing and 
tracking of 96 separate membranes in 96 separate sealed 

20 chambers. By processing all 96 arrays as a single 

sheet of material, significant time and cost savings 
are possible. 

The assay format can be reversed where the patient 
or organism's DNA is immobilized as the array elements 

25 and each array is hybridized with a different mutated 
allele or genetic marker. The gridded solid support 
can also be used for parallel non-DNA ELISA assays. 
Furthermore, the invention allows for the use of all 
standard detection methods without the need to remove 

30 the shallow barrier elements to carry out the detection 
step . 

In addition to the genetic applications listed 
above, arrays of whole cells, peptides, enzymes, 
antibodies, antigens, receptors, ligands, 
3 5 phospholipids, polymers, drug cogener preparations or 
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described in this invention for large scale screening 
assays in medical diagnostics, drug discovery, 
molecular biology, immunology and toxicology • 
5 The multi-cell substrate aspect of the invention 

allows for the rapid and convenient screening of many 
DNA probes against many ordered arrays of DNA 
fragments. This eliminates the need to handle and 
detect many individual arrays for performing mass 
10 screenings for genetic research and diagnostic 

applications. Numerous microarrays can be fabricated 
on the same solid support and each microarray reacted 
with a different DNA probe while the solid support is 
processed as a single sheet of material. 

15 

The following examples illustrate, but in no way 
are intended to limit, the present invention. 

Example 1 

20 Genomic-Complexitv Hybridizatio n to Micro 

DNA Arrays Representing the Yeast 
SaccharomycBS cerevisiae Genome with 
Two-Color Fluorescent Detection 

The array elements were randomly amplified PCR 

25 (Bohlander, et al., 1992) products using physically 

mapped lambda clones of S. cerevisiae genomic DNA 

templates (Riles, et al . , 1993). The PCR was performed 

directly on the lambda phage lysates resulting in an 

amplification of both the 35 kb lambda vector and the 

30 5-15 kb yeast insert sequences in the form of a uniform 

distribution of PCR product between 2 50-1500 base pairs 

in length. The PCR product was purified using 

Sephadex G50 gel filtration (Pharmacia, Piscataway, NJ) 

and concentrated by evaporation to dryness at room 

35 temperature overnight. Each of the 8 64 amplified 
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lambda clones was rehydrated in 15 pi of 3 x SSC in 
preparation for spotting onto the glass. 

The micro arrays were fabricated on microscope 
slides which were coated with a layer of poly-l-lysine 
5 (Sigma) . The automated apparatus described in Section 
IV loaded 1 pi of the concentrated lambda clone PCR 
product in 3 x SSC directly from 96 well storage plates 
into the open capillary printing element and deposited 
-5 nl of sample per slide at 380 micron spacing between 

10 spots, on each of 40 slides. The process was repeated 
for all 864 samples and 8 control spots. After the 
spotting operation was complete, the slides were 
rehydrated in a humid chamber for 2 hours, baxed in a 
dry 80° vacuum oven for 2 hours, rinsed to remove un- 

15 absorbed DNA and then treated with succinic anhydride 
to reduce non-specific adsorption of the labeled 
hybridization probe to the poly-l-lysine coated glass 
surface. Immediately prior to use, the immobilized DNA 
on the array was denatured in distilled water at 90° 

20 for 2 minutes. 

For the pooled chromosome experiment, the 16 
chromosomes of Saccharomyces cerevisiae were separated 
in a CHEF agarose gel apparatus (Biorad, Richmond, CA) • 
The six largest chromosomes were isolated in one gel 

25 slice and the smallest 10 chromosomes in a second gel 
slice. The DNA was recovered using a gel extraction 
kit (Qiagen, Chatsworth, CA) . The two chromosome pools 
were randomly amplified in a manner similar to that 
used for the target lambda clones. Following 

3 0 amplification, 5 micrograms of each of the amplified 

chromosome pools were separately random-primed labeled 
using Klenow polymerase (Amersham, Arlington Heights, 
IL) with a lissamine conjugated nucleotide analog 
(Dupont NEN, Boston, MA) for the pool containing the 

35 six largest chromosomes, and with a fluorescein 
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conjugated nucleotide analog (BMB) for the pool 
containing smallest ten chromosomes. The two pools 
were mixed and concentrated using an ultrafiltration 
device (Amicon, Danvers, MA) • 
5 Five micrograms of the hybridization probe 

consisting of both chromosome pools in 7.5 /xl of TE was 
denatured in a boiling water bath and then snap cooled 
on ice. 2.5 pi of concentrated hybridization solution 
(5 x SSC and 0.1% SDS) was added and all 10 pi 
10 transferred to the array surface, covered with a cover 
slip, placed in a custom-built single-slide humidity 
chamber and incubated at 60° for 12 hours. The slides 

0.1%SDS for 5 minutes, cover slipped and scanned. 

15 A custom built laser fluorescent scanner was used 

to detect the two-color hybridization signals from the 
1.8 x 1.8 cm array at 20 micron resolution. The 
scanned image was gridded and analyzed using custom 
image analysis software. After correcting for optical 

20 crosstalk between the fluorophores due to their 
overlapping emission spectra, the red and green 
hybridization values for each clone on the array were 
correlated to the known physical map position of the 
clone resulting in a computer-generated color karyotype 

25 of the yeast genome. 

Figure 6 shows the hybridization pattern of the 
two chromosome pools. A red signal indicates that the 
lambda clone on the array surface contains a cloned 
genomic DNA segment from one of the largest six yeast 

3 0 chromosomes. A green signal indicates that the lambda 
clone insert comes from one of the smallest ten yeast 
chromosomes. Orange signals indicate repetitive 
sequences which cross hybridized to both chromosome 
pools. Control spots on the array confirm that the 

35 hybridization is specific and reproducible. 
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The physical map locations of the genomic DNA 
fragments contained in each of the clones used as array 
elements have been previously determined by Olson and 
co-workers (Riles, et al . ) allowing for the automatic 
5 generation of the color karyotype shown in Figure 7. 
The color of a chromosomal section on the karyotype 
corresponds to the color of the array element 
containing the clone from that section. The black 
regions of the karyotype represent false negative dark 

10 spots on the array (10%) or regions of the genome not 
covered by the Olson clone library (90%) . Note that 
the largest six chromosomes are mainly red while the 
smallest ten chromosomes are mainly green matching the 
original CHEF gel isolation of the hybridization probe. 

15 Areas of the red chromosomes containing green spots and 
vice-versa are probably due to spurious sample tracking 
errors in the formation of the original library and in 
the amplification and spotting procedures. 

The yeast genome arrays have also been probed with 

2 0 individual clones or pools of clones that are 

f luorescently labeled for physical mapping purposes. 
The hybridization signals of these clones to the array 
were translated into a position on the physical map of 
yeast . 

25 

Example 2 

Total cDNA Hybridized to Micro Arrays of 
cDNA Clones with Two-Color 
Fluorescent Detection 

30 24 clones containing cDNA inserts from the plant 

Arabidopsis were amplified using PCR. Salt was added 
to the purified PCR products to a final concentration 
of 3 x SSC. The cDNA clones were spotted on poly-1- 
lysine coated microscope slides in a manner similar to 

35 Example 1. Among the cDNA clones was a clone 
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representing a transcription factor HAT 4, which had 
previously been used to create a transgenic line of the 
plant Arabidopsis , in which this gene is present at ten 
times the level found in wild-type Arabidopsis (Schena, 
5 et ah, 1992) . 

Total poly-A mRNA from wild type Arabidopsis was 
isolated using standard methods (Maniatis, et al., 
1989) and reverse transcribed into total cDNA, using 
fluorescein nucleotide analog to label the cDNA product 
10 (green fluorescence) . A similar procedure was 

performed with the transgenic line of Arabidopsis where 
the transcription factor HAT4 was inserted into the 

rranAmA 4-*— «*»#*^**<«» 4* l — «nun 

copies of mRNA from the transgenic plant are labeled 

15 with a lissamine nucleotide analog (red fluorescence) . 
Two micrograms of the cDNA products from each type of 
plant were pooled together and hybridized to the cDNA 
clone array in a 10 microliter hybridization reaction 
in a manner similar to Example 1. Rinsing and 

20 detection of hybridization was also performed in a 

manner similar to Example 1. Fig. 8 show the resulting 
hybridization pattern of the array. 

Genes equally expressed in wild type and the 
transgenic Arabidopsis appeared yellow due to equal 

25 contributions of the green and red fluorescence to the 
final signal. The dots are different intensities of 
yellow indicating various levels of gene expression. 
The cDNA clone representing the transcription factor 
HAT4, expressed in the transgenic line of Arabidopsis 

30 but not detectably expressed in wild type Arai?idopsis, 
appears as a red dot (with the arrow pointing €o it) , 
indicating the preferential expression of the 
transcription factor in the red-labeled transgenic 
Arabidopsis and the relative lack of expression of the 
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transcription factor in the green-labeled wild type 
Arabidopsis . 

An advantage of the microarray hybridization 
format for gene expression studies is the high partial 
5 concentration of each cDNA species achievable in the 10 
microliter hybridization reaction. This high partial 
concentration allows for detection of rare transcripts 
without the need for PCR amplification of the 
hybridization probe which may bias the true genetic 
10 representation of each discrete cDNA species. 

Gene expression studies such as these can be used 
for genomics research to discover which genes are 

development states or environmental conditions. Gene 
15 expression studies can also be used for diagnosis of 
disease by empirically correlating gene expression 
patterns to disease states. 



Example 3 

Multiplex ed Colorimetric Hybridization on 
a Gridded Solid Support 

A sheet of plastic-backed nitrocellulose was 

gridded with barrier elements made from silicone rubber 

according to the description in Section IV-A. The 

sheet was soaked in 10 x SSC and allowed to dry. As 

shown in Fig. 12, 192 M13 clones each with a different 

yeast inserts were arrayed 4 00 microns apart in four 

quadrants of the solid support using the automated 

device described in Section III. The bottom left 

quadrant served as a negative control for hybridization 

while each of the other three quadrants was hybridized 

simultaneously with a different oligonucleotide using 

the open-face hybridization technology described in 

Section IV-A. The first two and last four elements of 
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each array are positive controls for the colorimetric 
detection step. 

The oligonucleotides were labeled with fluorescein 
which was detected using an anti-f luorescein antibody 
5 conjugated to alkaline phosphatase that precipitated an 
NBT/BCIP dye on the solid support (Amersham) * Perfect 
matches between the labeled oligos and the M13 clones 
resulted in dark spots visible to the naked eye and 
detected using an optical scanner (HP ScanJet II) 
10 attached to a personal computer. The hybridization 
patterns are different in every quadrant indicating 
that each oligo found several unique M13 clones from 

skvn^tMrv <-\ 1 Q O *.t n V* ^ *\AV>f a * a n /"« a "m Ua-^-a +»V*a^ 

the open capillary printing tip leaves detectable 
15 dimples on the nitrocellulose which can be used to 
automatically align and analyze the images. 



Although the invention has been described with 
respect to specific embodiments and methods, it will be 
20 clear that various changes and modification may be made 
without departing from the invention. 
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IT IS CLAIMED: 

1. A method of forming a microarray of analyte- 
assay regions on a solid support, where each region in 
5 the array has a known amount of a selected, analyte- 
specific reagent, said method comprising, 

(a) loading a solution of a selected analyte- 
specific reagent in a reagent-dispensing device having 
an elongate capillary channel (i) formed by spaced- 

10 apart, coextensive elongate members, (ii) adapted to 
hold a quantity of the reagent solution and (iii) 
having a tip region at which aqueous solution in the 
channel forms a meniscus, 

(b) tapping the tip of the dispensing device 

15 against a solid support at a defined position on the 
surface, with an impulse effective to break the 
meniscus in the capillary channel and deposit a 
selected volume of solution on the surface, and 

(c) repeating steps (a) and (b) until said array 
20 is formed. 

2. The method of claim 1, wherein said tapping is 
carried out with an impulse effective to deposit a 
selected volume in the volume range between 0.01 to 100 

25 nl. 

3. The method of claim 1, wherein said channel is 
formed by a pair of spaced-apart tapered elements. 

30 4. The method of claim 1, for forming a plurality 

of such arrays, wherein step (b) is applied to ^a 
selected position on each of a plurality of solid 
supports at each repeat cycle proceeding step (c) . 
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5. The method of claim 1, which further includes, 
after performing steps (a) and (b) at least one time, 
reloading the reagent-dispensing device with a new 
reagent solution by the steps of (i) dipping the 
5 capillary channel of the device in a wash solution, 
(ii) removing wash solution drawn into the capillary 
channel, and (iii) dipping the capillary channel into 
the new reagent solution. 

10 6. Automated apparatus for forming a microarray 

of analyte-assay regions on a plurality of solid 
supports, where each region in the array has a known 

»_ - ^ "I — l_ _ J — _ _ ^ ..A. -m-^^s-mi^A^t J-k r» V» 1 

dlllUU.HU UJL U bClCU UCU , aiiaxjf uco^cwxi. j.v *~u*jw«* *~ , ~— 

apparatus comprising 
15 (a) a holder for holding, at known positions, a 

plurality of planar supports, 

(b) a reagent dispensing device having an open 
capillary channel (i) formed by spaced-apart , 
coextensive elongate members (ii) adapted to hold a 

20 guantity of the reagent solution and (iii) having a tip 
region at which aqueous solution in the channel forms a 
meniscus, 

(c) positioning means for positioning the 
dispensing device at a selected array position with 

25 respect to a support in said holder, 

(d) dispensing means for moving the device into 
tapping engagement against a support with a selected 
impulse, when the device is positioned at a defined 
array position with respect to that support, with an 

30 impulse effective to break the meniscus of liquid in 

the capillary channel and deposit a selected volume of 
solution on the surface, and 

(e) control means for controlling said positioning 
and dispensing means. 
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7. The apparatus of claim 6, wherein said 
dispensing means is effective to move said dispensing 
device against a support with an impulse effective to 
deposit a selected volume in the volume range between 

5 0.01 to 100 nl. 

8. The apparatus of claim 6, wherein said channel 
is formed by a pair of spaced-apart tapered elements. 

0 9. The apparatus of claim 6, wherein the control 

means operates to (i) place the dispensing device at a 
loading station, (ii) move the capillary channel in the 
device into a selected reagent at the loading station, 
to load the dispensing device with the reagent, and 

5 (iii) dispense the reagent at a defined array position 
on each of the supports on said holder. 

10. The apparatus of claim 6, wherein the control 
device further operates, at the end of a dispensing 

D cycle, to wash the dispensing device by (i) placing the 
dispensing device at a washing station, (ii) moving the 
capillary channel in the device into a wash fluid, to 
load the dispensing device with the fluid, and (iii) 
remove the wash fluid prior to loading the dispensing 

5 device with a fresh selected reagent. 

11. The apparatus of claim 6, wherein said device 
is one of a plurality of such devices which are carried 
on the arm for dispensing different analyte assay 

) reagents at selected spaced array positions. 

12. A substrate with a surface having a 
microarray of at least 10 3 distinct polynucleotide or 
polypeptide biopolymers per 1 cm 2 surface area, each 
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distinct biopolymer sample (i) being disposed at a 
separate, defined position in said array, (ii) having a 
length of at least 50 subunits, and (iii) being present 
in a defined amount between about 0.1 femtomole and 100 
5 nanomoles. 

13 . The substrate of claim 12 , wherein said 
surface is glass slide coated with polylysine, and said 
biopolymers are polynucleotides. 

10 

14. The substrate of claim 12, wherein said 
substrate has a water-impermeable backing, a water- 

on the film, where said grid (i) is composed of 
15 intersecting water-impervious grid elements extending 

from said backing to positions raised above the surface 
of said film, and (ii) partitions the film into a 
plurality of water-impervious cells, where each cell 
contains such a biopolymer array. 

20 

15. A substrate with a surface array of sample- 
receiving cells , comprising 

a water-impermeable backing, 

a water-permeable film formed on the backing, and 
25 a grid formed on the film, said grid being composed of 
intersecting water-impervious grid elements extending 
from said backing to positions raised above the surface 
of said film. 

30 16. The substrate of claim 15, wherein the cells 

of the array each contain an array of biopolymers. 

17. A substrate for use in detecting binding of 
labeled biopolymers to one or more of a plurality 
35 distinct polynucleotides, comprising 
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a non-porous, glass substrate, 

a coating of a cat ionic polymer on said substrate, 

and 

an array of distinct polynucleotides to said 
5 coating, where each biopolymer is disposed at a 
separate, defined position in a surface array of 
biopolymers. 



18. A method of detecting differential expression 
10 of each of a plurality of genes in a first cell type 
with respect to expression of the same genes in a 
second cell types, said method comprising 

isolated from the two cells types, where the cDNA's 

15 from the first and second cells are labeled with first 
and second different fluorescent reporters, 

adding a mixture of the labeled cDNA's from the 
two cell types to an array of polynucleotides 
representing a plurality of known genes derived from 

20 the two cell types, under conditions that result in 

hybridization of the cDNA's to complementary-sequence 
polynucleotides in the array; and 

examining the array by fluorescence under 
fluorescence excitation conditions in which (i) 

25 polynucleotides in the array that are hybridized 

predominantly to cDNA's derived from one of the first 
and second cell types give a distinct first or second 
fluorescence emission color, respectively, and (ii) 
polynucleotides in the array that are hybridized to 

3 0 substantially equal numbers of cDNA's derived from the 
first and second cell types give a distinct combined 
fluorescence emission color, respectively, 

wherein the relative expression of known genes in 
the two cell types can be determined by the observed 

35 fluorescence emission color of each spot. 



WO 95/35505 



PCT/US95/07659 



44 

19. The method of claim 18 , wherein the array of 
polynucleotides is formed on a substrate with a surface 
having an array of at least 10 2 distinct polynucleotide 
or polypeptide biopolymers in a surface area of less 

5 than about 1 cm 2 , each distinct biopolymer (i) being 

disposed at a separate, defined position in said array, 
(ii) having a length of at least 50 subunits, and (iii) 
being present in a defined amount between about .1 
femtomole and 100 nmoles. 

0 

20. The method of claim 19, wherein said surface 
is a glass slide coated with poly lysine, and said 
biopolymers are polynucleotides non—cov3.1ently bound to 
said poly lysine. 



15 
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Fig. 3 
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TCTTGCTCATTTTGATATTGCTC- TGTAGATTG- 
TACTGAGAGTGCA03'; and 5'-GCTACAAACAGC- 
GTCGACTTGAATGCCCCGACATCTTCGACTGT- 
GCGGT ATT TCACACCG- 3 ' ) were used to arnprty 
tne URA3 sequence of pRS3i6, and the reaction 
product was transformed mo yeast tor one- step gene 
recjiacement |R Rothsten Methods Enzymol. 1W. 
281 (1991)]. To create tne ax/7 ^;L£U? mutatcn con- 
tained on pi 14, a S.Okb Sal I fragment from pAXLT 
was coned mo pUC19. and an interna* 4.0-W5 Hpa 
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6.2-kb Hnd U genorric fragment earned on 
pSP72 (Promega). To create YEpMWT, a 1.6-kfc 
Bam HI fragment contamg MFA1. from pKK16 [K. 
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30. A W303 1A dervatwe. SY2625 {MAT* ura3-7 Jau2-3, 
I12trpi-1 ade2-1 can1-l00sst1bmfa2k:FU$UtacZ 
hts3\::FUShH!S3l. was the parent strain for the mutant 
search. SY2625 denvatrves for the mating assays, se- 
rved pherornone assays, and the putee-chase exper- 
iments nduded the fottowmg strains' Y49 tsfe22-/), 
Y115 1mtaW:LEU2l. Y142 laxl1::URA3l. Y173 
iaxnX:LEUZ). Y220 iaxi1::URA3 Ste23k:URA3\. Y221 
iste23k:URA3), Y231 (ax/7 k:l£U2 ste23X\-LEU2). 
and Y233 iste23&::LBJ2). MAT a denvatrves of 
SY2625 included the foliowtng strains: Y199 
(SY2625 made MA 7a). Y278 (sfe22-7). Y195 
[mfal k:LEU2). Y196 (ax/7 £l:LBJZ}, and Y197 
{ax/l.. UftA3).TheEGl23(MATateu2ura3frp7 can J 
t)is4) genetic background was used to create a set of 
strains for analysis of bud site selection. EG 123 de- 
rivatives included the toHowmg strains: Y175 
(ax/7A;.t£L/2), Y223 iaxJ1::URA3). Y234 (sfe23^: 
LEU2). and Y272 1&d1±:LEU2 ste23k:LBJZ\. 
MATa denvatrves of EG123 mduded the following 
strains: Y214 {EG123 made MATa) and Y293 
{axil b::LEU2). Ait strains were generated by means 
of standard genetc or molecular methods mvotong 
tne appropriate constructs (23). In particular, the axil 
ste23 double mutant strains were created by cross- 
ing of the appropriate MAT* ste23 and MATa axil 
rrtutants, followed by sponiation of the resuttant dtp- 

and tsoiatfor- of the double mutant from nonpa- 
a «vr» L H"a:;:; ■j'vurK* i i - 

a. izaqotofl, Yeast 2, 163 (1986)} piasmc cor- 
'.vr.~yg 3 c , 5 Kt Szl - tr^gryyV, of pA*2. 7. pIS 1 ! w^s 
oenved trom p1 29 by nseruon of a knker at the BgM! 
srte wrthn AXL 7 , which ted to an rv frame nserbon of 
the hemaggtotjntn (HA) eortope (DQYPYDvPDYA) t29) 
between amro aads 854 and 855 of the AXL 7 prod- 



uct. pC225 is a KS+ (Stratagene) ptasmd coniarTng 
a 05-kb Bam H+-^Sst I trag/nerrt from pAXl 7 . Sutastf 
tutiori mutations of the proposed act*e site of AxJi p 
were created with the use of pC225 and srte-speafc 
mutagenesis ns^nng appropriate synthetc otogonu- 
cteowes (ajrf7-H68A. 5 ' -GTGCTCACAAAGCGCT- 
GCCAAACCGGC-3'; ax}U67lA. 5'-AAGAATCAT- 
GTGCGCACAAAGGTGCGC-3'; and axll-E7iD. 5'- 
AAGAATCATGTGATCACAAAGGTGCGC-3'). The 
mutatons were confirmed by sequence anafyss. Af- 
ter rrwtageneas. the 0.4-kto Bam Hl-Msc I fragment 
from the mutageneed pC225 p4asmids was trans- 
terred nto pAXL 7 to create a set c^pH^i6piasmcis 
cam/ng dfflerent AXI7 atotes. p124 iajdl-H6&A), 
P130 iaxt1-E7iA). and pi32 Iaxr7-£7;D). Smiarty, a 
set of HA-tagged adetes earned on YEp352 were cre- 
ated after reptacement of the pi 51 Bam HWrtsc I 
fragment, to generate pi 61 iajrfI-£77A), p162 iaxJU 



The temporal, developmental, topographi- 
cal, histological, and physiological patterns 
in which a gene is expressed provide clues to 
its biological role. The large and expanding 
database of complementary DNA (cDNA) 
sequences from many organisms ( I ) presents 
the opportunity of defining these patterns at 
the level of the whole genome. 

For these studies, we used the small flow- 
ering plant Arabidopsis thalkuvi as a model 
organism. Arabidopsis possesses many ad- 
vantages for gene expression analysis, in- 
cluding the fact that it has the smallest 
genome of any higher eukaryote examined 
to date (2). Forty-five cloned Arabidopsis 
cDNAs (Table 1), including 14 complete 
sequences and 31 expressed sequence tags 
(ESTs), were used as gene-specific targets. 
We obtained the ESTs by selecting cDNA 
clones at random from an Arabidopsis 
cDNA library. Sequence analysis revealed 
that 28 of the 31 ESTs matched sequences 
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in the database (Table 1 ). Three additional 
cDNAs from other organisms served as con- 
trols in the experiments. 

The 48 cDNAs, averaging -1.0 kb, 
were amplified with the polymerase chain 
reaction (PCR) and deposited into indi- 
vidual wells of a 96-well microliter plate. 
Each sample was duplicated in two adja- 
cent wells to allow the reproducibility of 
the arraying and hybridization process to 
be tested. Samples from the microtiter 
plate were printed onto glass microscope 
slides in an area measuring 3.5 mm by 5.5 
mm with the use of a high-speed arraying 
machine (3). The arrays were processed by 
chemical and heat treatment to attach the 
DNA sequences to the glass surface and 
denature them (3). Three arrays, printed 
in a single lot, were used for the experi- 
ments here. A single microtiter plate of 
PCR products provides sufficient material 
to print at least 500 arrays. 

Fluorescent probes were prepared from 
total Arabidopsis mRNA (4) by a single 
round of reverse transcription (5). The Ato- 
T:P^' tj -.v v;prVtemented with hn 
a^ervtcnoiine receptor ^aCKR; mKN r 
... a Juucioii oi i i^.MJ ^w/w) betore cDNA 
v"; t_h; es V- t ■ provide 1 n terns 1 standard for 
calibration (5). The resulting fluorescently 
labeled cDNA mixture was hybridized to an 
array at high stringency (6) and scanned 



Quantitative Monitoring of Gene Expression 
Patterns with aJCompIementary DNA Microarray 

Mark Schena,* Dari Shalon,*t Ronald W. Davis, 
Patrick O. Brownt 

A high-capacity system was developed to monitor the expression of many genes in 
parallel. Microarrays prepared by high-speed robotic printing of complementary DNAs on 
glass were used for quantitative expression measurements of the corresponding genes. 
Because of the small format and high density of the arrays, hybridization volumes of 2 
microliters could be used that enabled detection of rare transcripts in probe mixtures 
derived from 2 micrograms of total cellular messenger RIMA. Differential expression 
measurements of 45 Arabidopsis genes were made by means of simultaneous, two-color 
fluorescence hybridization. 



with a laser (3). A high-sensitivity scan gave 
signals that saturated the detector at nearly 
all of the Arabidopsis target sites (Fig. 1A). 
Calibration relative to the AChR mRNA 
standard (Fig. 1A) established a sensitivity 
limit of - 1 : 50,000. No detectable hybridiza- 
tion was observed to either the rat glucocor- 
ticoid receptor (Fig. 1 A) or the yeast TRP4 
(Fig. 1A) targets even at the highest scan- 
ning sensitivity. A moderate -sensitivity scan 



of the same array allowed linear detection of 
the more abundant transcripts (Fig. IB). 
Quantitation of both scans revealed a range 
of expression levels spanning three orders of 
magnitude for the 45 genes tested (Table 2). 
RNA blots (7) for several genes (Fig. 2) 
corroborated the expression leveb measured 
with the microarray to within a factor of 5 
(Table 2). 

Differential gene expression was mvesti- 




Fig. 1. Gene expression monrtored with the use of cDNA microarrays. Fluorescent scans represented in 
pseudocolor correspond to hybridization intensities. Color bars were calibrated from the signal obtained 
with the use of known concentrations of human AChR mRNA in independent expenments. Numbers and 
letters on the axes mark the position of each cDNA. (A) High-sensrtivJty fluorescein scan after hybndizatton 
with fiuorescein-iabeied cDNA derived from wild-type plants. (B) Same array as in (A) but scanned at 
^ooerate sensitrvity (C and D) A single array was probed with a i i mixture of fluorescein- labeled ^DN^ 



resoo 



TDinc; to mMh4/* r r - .v;,,: ■ ¥ -: ... 
* : v I'ssafmr^ nuorescenr^ nrresoonomg f v ^hna r-or^ 1 4 -transgenic tia-Ub .... . L 
■'■■1 F\ A stnqie array was Drooed wrth all mixture ot fluorescein- labe+ed cDNA from root *.\^ue> anri 
,;ssamine-iabe4ed cDNA from leaf tissue. The single array was then scanned successively to detect the 
fluorescein fluorescence corresponding to mR NAs expressed m roots (E) and the Kssamine fluorescence 
corresponding to mRNAs expressed in leaves (F). 



gated with a simultaneous, rwo-color hv- 
bridization scheme, which served to mini- 
mize experimental variation inherent in the 
comparison of independent hybridizations. 
Fluorescent probes were prepared from two 
mRNA sources with the use of reverse tran- 
scriptase in die presence of fluorescein- and 
lissamine-labeled nucleotide analogs, re- 
spectively (5). The two probes were then 
mixed together in equal proportions, hy- 
bridized to a single array, and scanned sep- 
arately for fluorescein and lissamme emis- 
sion after independent excitation of the two 
fluorophores (3). 

To test whether overexpression of a sin- 
gle gene could be detected in a pool of total 
Arabidopsis mRNA, we used a microarray to 
analyze a transgenic line overexpressing the 
single transcription factor RATA (8). Fluo- 
rescent probes representing mRNA from 
wild- type and HAT4- transgenic plants were 
labeled with fluorescein and lissamme, re- 
spectively; the two probes were then mixed 
and hybridized to a single array. An intense 
hybridization signal was observed at the 
position of the HAT4 cDNA in the lissa- 
mine-specific scan (Fig. ID), but not in the 
fluorescein-spccific scan of the same array 
(Fig. 1C). Calibration with AChR mRNA 
added to the fluorescein and lissamine 
cDNA synthesis reactions at dilutions of 
1:10,000 (Fig. 1C) and 1:100 (Fig. ID), 
respectively, revealed a 50-fold elevation of 
HAT A mRNA in the transgenic line rela- 
tive to its abundance in wild-type plants 
(Table 2). This magnitude of HAT4 over- 
expression matched that inferred from the 
Northern (RNA) analysis within a factor of 
2 (Fig. 2 and Table 2). Expression of all the 
other genes monitored on the array differed 
by less than a factor of 5 between HAT4- 
transgenic and wild-type plants (Fig 1, C 



Wild type HAT4 transgenic 
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ROC1 




Human 
AChR 
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plants were sported onto nylon membranes and 
probed with the cDNAs indicated. Purified human 
AChR mRNA was used for calibration. 
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and D, and Table 2). Hybridization of flu- 
oresce in- labeled glucocorticoid receptor 
cDNA (Fig. 1C) and lissamine-labeled 
TRP4 cDNA (Fig. ID) verified the pres- 
ence of the negative control targets and the 
lack of optical cross talk between the two 
fluorophores. 

To explore a more complex alteration in 
expression patterns, we performed a second 
rwo-color hybridization experiment with 
fluorescein- and lissamine-labeled probes 
prepared from root and leaf mRNA, respec- 
tively. The scanning sensitivities for the 
two fluorophores were normalued by 
matching the signab resulting from AChR 



'Proprietary sequence of Stratagene (La Jotta, CaWomta). 



mRNA, which was added to both cDNA 
synthesis reactions at a dilution of 1:1000 
(Fig. 1 , E and F). A comparison of the scans 
revealed widespread differences in gene ex- 
pression between root and leaf tissue (Fig. 1, 
E and F). The mRNA from the light-regu- 
lated CAB/ gene was —500-fold more abun- 
dant in leaf (Fig. IF) than in root tissue 
(Fig. IE). The expression of 26 other genes 
differed between root and leaf tissue by 
more than a factor of 5 (Fig. 1, E and F). 

The HAT4-transgenic line we examined 
has elongated hypocotyls, early flowering, 
poor germination, and altered pigmentation 
(8). Although changes in expression were 



tNo match ri tne database; novel EST, 



observed for HAT4, large changes in ex- 
pression were not observed for anv of the 
other 44 genes we examined. "This wa> 
somewhat surprising, particularly because 
comparative analysis of leaf and root tissue 
identified 27 differentially expressed genes. 
Analysis of an expanded set of genes mav be 
required to identify genes whose expression 
changes upon HAT4 overexpression; alter- 
natively, a comparison of mRNA popula- 
tions from specific tissues of wild-type and 
HAT4 -transgenic plants may allow identi- 
fication of downstream genes. 

At the current density of robotic pnnting, 
it is feasible to scale up the fabrication pro- 
cess to produce arrays containing 20,000 
cDNA targets. At this density, a single array 
would be sufficient to provide gene-specific 
targets encompassing nearly the entire rep- 
ertoire of expressed genes in the Arabidopsis 
genome (2). The availabiliry of 20,274 ESTs 
from Arabidopsis (1,9) would provide a rich 
source of templates for such studies. 

The estimated 100,000 genes in the hu- 
man genome (10) exceeds the number of 
Arabidopsis genes by a factor of 5 (2). This 
modest increase in complexity suggests that 
similar cDNA microarrays, prepared from 
the rapidly growing repertoire of human 
ESTs (1), could be used to determine the 
expression patterns of tens of thousands of 
human genes in diverse cell types. Coupling 
an amplification strategy to the reverse 
transcription reaction (11) could make it 
feasible to monitor expression even in 
minute tissue samples. A wide variety of 
acute and chronic physiological and patho- 
logical conditions might lead to character- 
istic changes in the patterns of gene expres- 
sion in peripheral blood cells or other easily 
sampled tissues. In concert with cDNA mi- 
croarrays for monitoring complex expres- 
sion patterns, these tissues might therefore 
serve as sensitive in vivo sensors for clinical 
diagnosis. Microarrays of cDNAs could thus 
provide a useful link between human gene 
sequences and clinical medicine. 

Table 2. Gene expression rronitonng by rnicroar- 
ray and RNA blot analyses; tg. HAT4 -transgenic. 
See Table 1 tor additional gene information. Ex- 
pression levels (w/w) were calibrated with the use 
of known amounts of human AChR mRNA. Values 
for the microarray were deterrruned from microar- 
ray scans (Fig. 1); values for the RNA blot were 
deterrntr>ed from RNA blots (Rg. 2). 



Gene 


Expression level (w/w) 

Microarray RNA biof 


>tA74 
rtAt'4 (tg/ 
ROC1 
ROC1 (tg) 


1-8300 
1:1 bO 
1:1200 
1:260 


150 
1:6300 
1:210 
1:1800 
1:1300 



Table 1. Sequences contained on the cDNA microarray. Shown is the position, the known or putatrve 
function, and the accession number of each cDNA in the microarray {Fig. 1 ). All but three of the ESTs used 
in this study matched a sequence in the database. NADH, reduced form of nicotinamide adenine 
dmucieotide: ATPase, adenosine triphosphatase; GTP, guano sine triphosphate. 





cDNA 


Function 


Accession 
number 


a1 : 2 


AChR 


Human AChR 




a3, 4 


EST3 


Actin 


H36236 


a5,6 


EST6 


NADH dehydrogenase 


Z27010 


a7, 8 


AAC1 


Actin 1 


M20016 


a9, 10 


EST12 


Unknown 


U36594T 


a11. 12 


EST 13 


Actin 


T45783 


b1, 2 


CAS/ 


Chlorophyll a/b binding 


M85150 


b3, 4 


EST17 


Phosphogtycerate kinase 


T44490 


b5, 6 


GA4 


GibbereUic acid biosynthesis 


L37126 


b7, 8 


EST19 


Unknown 


U36595T 


b9, 10 


GBF-1 


G-box binding factor 1 


X63894 


b1 1, 12 


EST23 


Bongation factor 


X52256 


CI , £ 




Aldolase 


T04477 


C3, 4 




o-dox otnoing tact or z 


X63895 


co, b 


to (o4 


Chioroplast protease 


R87034 


r7 ft 


CO 1 v>J 


\J\ I W WW 1 


Tl A i CO 




CO I H I 




I cjLI Aj 


CIV 12 


rGR 


Rat glucocorticoid receptor 


M14053 


d1, 2 


EST42 


Unknown 


U36596I 


d3, 4 


EST45 


ATPase 


J04185 


d5. 6 


HAT1 


Homeobox-teucine zipper 1 


U09332 


d7, 8 


EST46 


Light harvesting complex 


T04063 


d9. 10 


EST49 


Unknown 


T 76267 


dll. 12 


HA 72 


Homeobox-leucine zipper 2 


U09335 


e1.2 


HAT 4 


Homeobox-teucine zipper 4 


M90394 


e3. 4 


EST50 


Phosphoribulokinase 


T04344 


e5, 6 


HA 73 


Homeobox-leucine zipper 5 


M90416 


e7, 8 


EST51 


Unknown 


Z33675 


e9. 10 


HA 722 


Homeobox-leucine zipper 22 


U09336 


el 1, 12 


EST52 


Oxygen evolving 


T21749 


f1. 2 


EST59 


Unknown 


Z34607 


f3. 4 


KNAT1 


KnorfeoMike homeobox 1 


U14174 


15. 6 


EST60 


RuBtsCO small subunrt 


X14564 


f7, 8 


EST69 


Translation elongation factor 


T42799 


f9, 10 


PPH1 


Protein phosphatase 1 


U34803 


f11. 12 


EST 70 


Unknown 


T44621 


91.2 


EST 75 


Chtoroplast protease 


T43698 


g3.4 


EST 7 8 


Unknown 


R65481 


g5, 6 


ROC1 


Cyclophilin 


L14844 


g7. 8 


EST82 


GTP binding 


X59152 


g9. io 


EST83 


Unknown 


Z33795 




FSTB4 


! In known 


T45278 
















->481r 








^904 i r 




-3T10G 


^jgnt harvesting comptex 


Z 18205 






ught harvesting complex 


X03909 


hi 1, 12 


TRP4 


Yeast tryptophan biosynthesis 


X04273 
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Gene Therapy in Peripheral Blood 
Lymphocytes and Bone Marrow for 
ADA Immunodeficient Patients 

Claudio Bordignon,* Luigi D. Notarangelo, Nadia Nobili, 
Giuliana Ferrari, Giulia Casorati, Paola Panina, Evelina Mazzolari, 
Daniela Maggioni, Claudia Rossi, Paolo Servida, 
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Adenosine deaminase (ADA) deficiency results in severe combined immunodeficiency, 
the first genetic disorder treated by gene therapy. Two different retroviral vectors were 
used to transfer ex vivo the human ADA minigene into bone marrow cells and peripheral 
blood lymphocytes from two patients undergoing exogenous enzyme replacement ther- 
apy. After 2 years of treatment, long-term survival of T and B lymphocytes, marrow cells, 
and granulocytes expressing the transferred ADA gene was demonstrated and resulted 
in normalization of the immune repertoire and restoration of cellular and humoral immunity. 
After discontinuation of treatment, T lymphocytes, derived from transduced peripheral 
blood lymphocytes, were progressively replaced by marrow-derived T cells in both pa- 
tients. These results indicate successful gene transfer into long-lasting progenitor cells, 
producing a functional multilineage progeny. 



Oevcre combined immunodeficiency asso- 
ciated with inherited deficiency of ADA 

( 1 ) is usually fatal unless affected children 
are kept in protective isolation or the im- 
mune system is reconstituted by bone mar- 
row transplantation from a human leuko- 
cyte antigen (HLA)-identical sibling donor 

(2) . This is the therapy of choice, although 
it is available only for a minority of patients. 
In recent years, other forms of therapy have 
been developed, including transplants from 
haploidentical donors (3, 4) t exogenous en- 
zyme replacement (5), and somatic-cell 
gene therapy (6-9). 

We previously reported a preclinical mod- 
el in which ADA gene transfer and expression 



C. Bordignon. N Nob*. G. Ferran. D. Maggont. C. Rossi. 
P Servida. F. Mavibo. Telethon Gene Therapy Program 
tor Genetx: Diseases. DIBIT, istrti/to Soentrfco K s. Raf- 
"v^ Mdar H aN 

~:-n: o: ^eaia*^.;.-, « uvr-^ ■ ■- -v^suid v*»aica: ..>■: ->■■■=■ 
"ftscia rtar, 

- Casoraa, unna o NTYTxrocrwnica, DiBH, istrtuio 
er.otco r-i. S Rariaete. M»an r aar/. 
P. Panina, Roche MOano Rcerche, Mdan. rtary. 

* To wnom conespondence shouid be addressed. 



successfully restored immune functions in hu- 
man ADA-deficient (ADA") peripheral 
blood lymphocytes (PBLs) in immunodefi- 
cient mice in vivo (JO, ]]). On the basis of 
these preclinical results, the clinical applica- 
tion of gene therapy for the treatment of 
ADA" SCID (severe combined immunodefi- 
ciency disease) patients who previously failed 
exogenous enzyme replacement therapy was 
approved by our Institutional Ethical Com- 
mittees and by the Italian National Commit- 
tee for Bioethics (J 2). In addition to evaluat- 
ing the safety and efficacy of the gene therapy 
procedure, the aim of the study was to define 
the relative role of PBLs and hematopoietic 
stem cells in the long-term reconstitution of 
immune functions after retroviral vector-me- 
diated ADA gene transfer. For this purpose, 
two structurally identical vectors expressing 
^- h-rr^n A OA complementary DNA 

r -*nnT «•■.■- 'T^rtrtim Mre^ in a nonfunctional 
^ion v -t cne virai iong- terminal repeal 
-T7\), were used to transduce PBLs and bone 
marrow (BM) cells independently. This pro- 
cedure allowed identification of the origin of 



SCIENCE • VOL 270 • 70 OTTORFR TQQS 




"-•'•as*.- 



1 1/ / uu/uo 




editOriCll To affinity... and beyond! fcUT 



T]eWS & vieWS Who's afraid ofepistasis? 

Wayne N Frankc! & Nicholas J Schork 



Meiotic nondisjunction does the two-step 
Terry Orr- Weaver | n &y\ 

Flood warning — resistance genes unleashed 
Richard Michelmore 



Nature Genetics 
Editor 

Kevin Davies 

Assistant Editors 
Laune Goodman 
Bene Phimister 

Production Editor 

J. Stuart Griffith 

Assistant Production Editor 
Ken Krattenmakef 

Editorial Assistant 
Janelie Bolden 



CO TVeSp 0 Yl detl C€ Toward a unified genetic map of higher plants, transcending 
* the monocot-dicot divergence 

A H Paterjon, T-H Lan, K P Reischmann. C Chang, Y-R Lin, S-C Liu> M D 
Burow, S P Kowalski, C S Katsar, T A DelMonte, K A Feldmann, K F 
Schertz & ] F Wendel 

Non-canonical introns are at least 10 9 years old 

H-| Wu, P Gaubier-Comella, M Delseny, F Grellet, M Van Montagu & 
PRouze 

Vat92Met variant of the melanocyte stimulating hormone 
receptor gene 

X Xu, M ThornwalK L-G Lundin & V Chhajlani 



pTOQTeSS Genes responsible for human hereditary deafness: 
r ® symphony of a thousand 

Christine Petit 



367 
371 
374 
376 

380 



383 



384 



385 



Washington Bureau Chief 

Barbara J. Culhton 



Editorial Office 

545 National Press Building 

Washington DC 20045 

Tel: (202)626-2513 

Fax: (202) 626-0970 

email: natgenOnaturedc.com 

WWW: genetics.nature.com 




articles 

Cloning and characterization of a novel bicoid- related 392 
homeobox transcription factor gene, RIEG, involved in Rieger 
syndrome 

E V Semina, R Reiter, N J Leysens, W L M AJward, K W Small, 

N A Datson, J Siegel-Bartelt, D Bierke- Nelson, P Bitoun, B U Zabel, 

J C Carey & J C Murray 

Susceptible chiasmate configurations of chromosome 21 400 
predispose to non-disjunction in both maternal meiosis I and 
meiosis II 

N E Lamb, S B Freeman, A Savage-Austin, D Pettay, Lisa Taft, ] Hersey, 
Y Gu, I Shen, D Saker, K M May, D Avramopoulos. M B Petersen, 
A Hallberg, M Mikkelsen, T I Hassold & S L Sherman \~^^} 

Spontaneous X chromosome Ml and Mil nondisjunction events 406 
in Drosophiia melanogaster oocytes have different 
recombinational histories 

K E Koehler, C L Boulton, H E Collins, R L French, K C Herman, 
S M Lacefield, L D Madden, C D Schuctz 6c R S Hawley |^g^| 

Suppression of the novel growth inhibitor p33 ING promotes 41 5 
neoplastic transformation 

I Garkavisev, A Kazarov, A Gudkov & K Riabowol 1 



Cover art: Ken Krattenmaker 



Nature Genetics (ISSN 1061-4036) * published monthly by Nature PuWishtng Co . headquartered at 345 Park Avenue South. New Yonv NY 10010, which ts owned by Nature America 
inc.. a subsidiary ol Macmittan Magazmes Ltd, of London. EdrtooaJ Office Nature Genetcs. 545 Natonai Press BmWmg. Washington DC 20045 Telephone (202) 626 2513. Fax (202) 
626 0970 e~maii naigenOnjturwdc com North Am#n«n Advertising Nature Genets 345 Par* Avenue Soum New YryV NY *0OVj Tr*+*pJVioe Q:?: ??£ 9?0C f2"2) 696 9606 
■ n. M in *d ( tft»i.pg ^i^r, rf .,. ' - i W, W^* fiY< - .* - • • ♦* ..b»cnption» r #n . w »l* 

*"-0»t '■■■* »CtO'"»»* b*C* H*Wfl *"C m:i LUIiOmii ItOOCI OuettKJCU iTkLXik; l * * JOT* sec "." ^»lur» L>«r*«tw:S ouIjiC. f :DlH> , -f»L. ■ r *^ m„^i.,_,* ^rmfir *Df\ H v 

• eiepnone u 1 ' 2 "?6 9? 00 r *» t'2'2) 696 9606 Ouiikj* North Am»nci Nature oenetcs. Macrruttan Magazines -onw ->ou-. - njr sfre+: , ^-y. si axw 
-■f v*K>nr aii ' is R3 \ 4000 (u 1 " S4J 4S96 Annuel subsenpbon rates uVOanaaa. USS495 (rtstjtuuonat/corpcxaie). uSJi^b (indivtouai maKing personal payment), Canada 
h ;j tor GST. BN 14091 1 595 RT UK C350 (nstrtLrtionat/corporatel fl 75 dndtvKKia 1 making personai payment) Japan Contart Ma* ,,r» tapan ¥ v. s^-" 1 **'.-** 9V?" 4 C 3 5 
r "yaya iamachi. bhm| U *u-Ku. To*iyo l b2 Rtpnnts Nature Genetics Reonryj Deo* ?J5 p ani a»*ni>e c ^ ► '■ - - - _ ^ ^ r c r.od<cai» 

_o . - - Se-<c oaaress charges to N-;u't Gertt^s Subscnpton Dept. 345 Pan\ Avenue South. New Yom. NY 10010. Eiecui*ve Offers ot Nature Amenca. tnc : Ncholas Byam 
bnaw Cnairman ol tr>* Board: Mary Waftham. PresKjent. Edward Vales. Secretary -Treasure* Published m Japan by Nature Japan KK, Shm-Mitsuke Bmiding. 3F. 3-6 Ichigaya TamaoY 
Shmjuka-ku Tokyo 162. Japan Printed in the USA pv Cadmus Journal Services C1996 N3tur* Ame-tra 



Njture Publishing Co. 
■ 345 Park Avenue South 
10th floor 

New Yorfc, NY 10010-1707 
Tel: (212)726-9200 
Fax: (212) 696-9606 

President-Publisher 

Mary Wattham 

Vice President Sales 

Manon Delaney 

Vice President Marketing 

James A. Skowrenski 

American Advertising Sales 
Manager 

Sande T. Giaccone {New York) 

European Advertising Sales 
Manager 

Kathryn Wayman (London) 

Classified Advertising Sales 
Manager 

"rika A. Simon (New York) 
iike Grant (London) 

Assistant Classified Sales 
Manager 

Benjamin Crowe (New York) 

Production & Information 
Systems Director 

Nick Kemp 

Circulation Manager 

Moira Musto (New York) 
Nic Harman (London) 

-roup Marketing Manager 

<na Dzurenda 



Macmillan Magazines Ltd 

Porters South 

Crinan Street 

London N1 9XW 

Tel: 44 (0)171 833 4000 

"=»x: 44 (0)171 643 4596 

anaging Director 

'-ay Barker 

Publishing Director 
Andy Sutherland 

Editor-in-Chief, 
Nature publications 

Philip Campbell 

Art Director 
Jane Walker 



Nature Japan KK 
Shin-Mitsuke Bldg 
3-6 Ichigaya Tamachi 
Shinjuku-ku 
Tokyo 162 

Telephone 03 3267 8751 



articles ap cr -based approach for isolating pathogen resistance 

genes from potato with potential for wide application in plants 
D Letsicr. A Ballvora, F Salamini & C Gebhardt ggf] 

Identification of a RING protein that can interact in vivo with 
the BRCA1 gene product 

L C Wu, 2 W Wang, I T Tsan, M A Spillman, A Phung, X L Xu, 
M-C W Yang, L-Y Hwang, A M Bowcock & R Baer 

Detection of heterozygous mutations in BRCA1 using high 
density oligonucleotide arrays and two-colour fluorescence 
analysis 

I G Hacia, L C Brody, M S Chee, SPA Fodor & F S Collins iT^T 

Quantitative phenotypic analysis of yeast deletion mutants 
using a highly parallel molecular bar-coding strategy 

D D Shoemaker, D A Lashkari, D Morris, M Mittmann & R \V Davis \7dTT 



letters 



Use of a cDNA microarray to analyse gene expression patterns 

in human cancer . ^ . ... 

I DeRisi, L Peniand & P O Brown (Group l);ML Burner. P S Meitzer, 
M Ray, Y Chen, Y A Su & J M Trent (Group 2) \~^T\ 

Retinal-specific guanylate cyclase gene mutations in Leber's 
congenital amaurosis 

I Perrault. 1 M Ro7pt. P r^lvac $>C+rh*r a r-» m , u n«nf,., c tllj:. 

' - • . 4 l^uiuuj, j V^IIBlt II 1 1, 

E Souied, I Ghazi, C Leowski, M Bonnemaison, D Le Pasiier, J Frezal, 
i-L Dufier, S Pittler» A Munnich fie J Kaplan 

Complex interactions of new quantitative trait loci, Siud , 
Sluc2 t Sluc3, and S/uo4, that influence the susceptibility to lung 
cancer in the mouse 

R J A Fijneman, S S de Vries, R C Jansen & P Demant 



421 



430 



441 



4S0 



457 



461 



Gene interaction and single gene effects in colon tumour 
susceptibility in mice 

T van Wez el, A P M Stassen, C J A Moen, A A M Hart, M A van der Valk & 
P Demant 



A major quantitative trait locus influences hyperactivity in the 
WKHA rat 

M-P Moisan, H Courvoisier, M-T Bihoreau, D Gauguier, E D Hendley, 
M Lathrop, M R James & P Mormede 

An H-YD b epitope is encoded by a novel mouse Y chromosome 
gene 

A Greenfield, D Scotl, D Pennisi, I Ehrmann, P Ellis, L Cooper, 
E Simpson fic P Koopman 

Homozygosity mapping of Hallervorden-Spatz syndrome to 
chromosome 20p12.3-p13 

T L) Taylor, M Litt, P Kramer. M Pandolfo, L Angelini, N Nardocci, S Davis, 
M Pineda. H Hattori, P J Flett, M R Cilio, E Bertini & S J Huyflick 

Identification of BTG2 t an antiproliferative p53-dependent 
component of the DNA damage cellular response pathway 

J-P Kouault, N Falette, F Guehenneux, C Guilloi, R Rimokh, Q Wang, 
C Berthet, C Moyret-Lnlle, P Savatier. B Pain, P Shaw, R Berger, 
I Samarui, J-P Magaud, M Ozturk, C Samarut & A Puiiieux 



465 



468 



471 



474 



479 



482 



correction/errata See pages 487-488 



letters 



'Howard Hughes 
Mcdn nl Institute, 
''Department of 
llioeliemiitry 
Stanford University 
Main til Center, 
Stiintoril, California 

USA 
1 Laboratory of 
Cottar Genet Us, 
Sat tonal Center jor 
Human Genome 
Re>carch. National 
ln>ttitiie> of Health, 
BcthexUi, Marylaml 
20S92. USA 

7 P. c-M.L B. 



Use of a cDNA microarray to 
analyse gene expression 
patterns in human cancer 

Joseph DeRisi 1 *, Lolita Penland 2 8c 
Patrick O. Brown 2 (Group 1); 
Michael L Bittner 3 *, Paul S. Meltzer\ 
Michael Ray 3 , Yidong Chen 3 , Yan A. Su 3 & 
Jeffrey M. Trent 1 (Group 2) 

The development and progression of cancer 1 * 3 and 
the experimental reversal of tumorigenicity 4,5 are 
accompanied by complex changes in patterns of 
gene expression. Microarrays of cDNA provide a 
powerful tool for studying these complex phenom- 
ena 6 " 8 . The tumorigenic properties of a human 
melanoma cell line, UACC-903, can be suppressed 
by introduction of a normal human chromosome 6, 
resulting in a reduction of growth rate, restoration 
of contact inhibition, and suppression of both soft 
agar clonogenicity and tumorigenicity in nude 
mice 4,5,9 . We used a high density microarray of 
1,161 DNA elements to search for differences in 
gene expression associated with tumour suppres- 
sion in this system. Fluorescent probes for 
hybridization were derived from two sources of cel- 
lular mRNA [UACC-903 and UACC-903(+6)] which 
were labelled with different fluors to provide a direct 
and internally controlled comparison of the mRNA 
levels corresponding to each arrayed gene. The flu- 
orescence signals representing hybridization to 
each arrayed gene were analysed to determine the 
relative abundance in the two samples of mRNAs 
corresponding to each gene. Previously unrecog- 
nized alterations in the expression of specific genes 
provide leads for further investigation of the genet- 
ic basis of the tumorigenic phenotype of these cells. 

DNA microarrays. containing KIM total elements, 
including S70 different cDNAsand controls*' 1 1 (see 
Methods), were printed robotically onto a glass micro- 
scope slide in tour quadrants covering an area of about 
I enr (Fig. I ). We prepared fluorescent cDNA probes 
using total poly (A)* mRNA from UACC-903 cells and 
UACC-V03{+6) cells by labelling with a green and red 
fluor, respectively. A mixture of the two flourescently 
labelled probes was hybridized to the DNA microarray. 
This comparative hybridization method, coupled with 
the doping of synthetic standards and an estimation of 
statistically significant deviation for local background 
variance allowed a direct and quantitative comparison 
of the relative abundance of individual DNA sequences 
in this complex simple'"*. We added a set ol synthetic 
poly (A)*-tailed 'mRNAs' U> the purified mRNA from 



spond to genes preferentially expressed in the tumori- 
genic UACC-903 cell line, and the reddish spots corre- 
spond to genes preferentially expressed in the 
non-tumongenic UACC-903(+6) cell line. Genes 
expressed at approximately equal levels in the two cell 
lines appear yellow or brown. A portion of the array at 
higher magnification highlights the diverse pattern of 
differential expression observed ( Fig. 26). In Fig. 2c, rec- 
tangles corresponding to specific array elements are 
coloured to reproduce the hue and intensity of the fluo- 
rescent signal at each element. The hybridization signals 
from a duplicated set of genes are shown juxtaposed, to 
illustrate the reproducibility of the hybridization signals 
for each gene. 

To address the possibility that an apparent difference 
in expression might result from experimental variables 
unrelated to the difference in chromosomal composi- 
tion between the two cell lines, we examined the vari- 
ance in expression for 90 'housekeeping' genes. We 
selected these genes based on the assumption that they 
would not be differentially expressed between the two 
cell lines. The averaged red/green ratio for this subset of 

uenes was 1.13. The averaped red/i*reen ratio for th^ 

^ t , v _ ... _ — 

of five internal standards was 0.97 (/; = 10). The vari- 
ability in the expression level of the housekeeping genes 
probably overestimates the experimental variability in 
measuring differential expression. As a conservative stan- 
dard, an absolute fluorescent signal (red or green) with 
an intensity greater than that observed at the control 
array elements containing total human genomic DNA 
was considered to represent specific hybridization. Gene- 
specific hybridization was therefore only considered sig- 
nificantly different between samples if the following two 
criteria were met: i) the signal intensity (green or red) 
exceeded this threshold; and ii) the logarithm of the 
red /green fluorescence signal ratio differed by >3 S.D. 
from the mean logarithm of this ratio for the 'house- 
keeping gene panel (that is, ratios <0.52 or >2.4). 

By these criteria. mRNA levels for 1 5/870 (1.7%) genes 
were significantly diminished, while the mRNA levels 
tor 63/870 (7.3%) genes were significantly increased in 
association with suppression ol tumorigenicity by intro- 
duction of chromosome 6. To test the reliability of 
microarray hybridization results in identifying differen- 
tially expressed genes, we analysed 16 genes by north- 
ern analysis. In each case, the results of northern analysis 
corroborated the differential gene expression identified 
by microarray hybridization (Fig. 3). 

Significant differences in expression between these 
two cell lines identified several genes as candidates for 
determining features of the tumorigenic phenotype of 
the melanoma cells. For example, among the genes 
detected with significantly higher expression (> 10-fold) 
in the tumorigenic cells was the human brown locus pro- 
tein (TRP I /melanoma antigen gp75). This is the most 
abundant glycoprotein in mclanocytic cells and a critical 
ph"! nvmbr.me protem*-- 1 ' Additionally, its 
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In a representative two-colour fluorescent scan of all cell-cell signaling durine development 1 * 1 . 




Labeled Poly dT 

Labeled 903 mRNA 
♦ competitors 



Labeled Cot 1 

Labeled 903 mRNA 
♦ competitors 



s. 



ARRAY QUADRANT MAP (4X=116D 



• • • 



• (54) Hybridization Specificity Controls 
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Fig. 1 Properties of cDNA mtcroarrays. a. A fluorescent scan of DNA printed onto a poty- lysine coated slide. The DMA rs stained wrth a DMA-specific fluorescent 
dye, YOYO. The center- to- center spacing of adjacent spots is 450 M- allowing the potential for up to 10.000 spots/2.54 X 7.62 cm microscope slide. 6. Effi- 
cient blocking of hybridization to DNA repeats. Hybridization of fluorescein- labelled poly (dT)* to arrays in the absence of competitor produces strong 
hybridization to immobilized poly (dA)' as well as to some cDNAs. such as the EST T64827 shown. Rhodamine-labelted cDNA (red) from the UACC-9Q3 cell 
line hybridized m the presence of poly (dA)* Wocker shows little if any signal at either site (Total H = total human). Similarly, hybndization with fluorescein- tabeHed 
Cotl DNA in the absence of competitor produces bright signal on immobilized Con DNA. total human DNA and at some cDNA elements (presumed to con- 
tain highly repeated sequences, such as R23416): while Rhoda mine- labelled cDNA (red) from the UACC-903 cell line produces little it any signal at these 
locations when hybridized in the presence of excess unlabeled poly (dA)* . and human Cot1 DNA. The absence of signal at some cDNA locations following 
UACC-903 cDNA hybndizations also indicates that the PC R- amplified, plasmid vector sequences at all cDNA targets do not contnbute significant hybridiza- 
tion signal, c. Schematic of the array organisation. Robotic printing from 96 well microliter trays was earned out with 4 print heads, spaced to tit into 4 adja- 
cent microtiter wells. This maps the contents of each tray into four separate quadrants on the glass slide. A colour-coded map of the general distribution of 
target types in each of the resulting quadrants is shown. 



els were elevated by the addition of a normal chromo- 
some 6(17 genes) .ire known to be activated by IFN-y. a 
cardinal proinflammatory cytokine that, among other 
activities, induces expression of the gene products of the 
MHC class II locus. For example, the mRNA encoding 
monocyte chemotactic protein I ( MCAF/MCP 1 ), a 
cvtokine that induces monocyte chemotaxis and activa- 
tion lr,f \ was more than 10-fold less abundant in the 
tumongenic cell line. In the skin, MFC I is critical in the 
regulation ol cutaneous monocyte trafficking" 1 " 1 ', and 
elevated expression plays a role in suppression of tumour 
growth .md metastasis 14- - 1 . The mechanism by which 
these interteron-y regulated genes are induced in UACC- 
903 cells by transfer of n normal chromosome 6 remains 
lo be determined. It is worth noting, however, that the 
interferon-y receptor gene is localized to the distal long 
arm of human chromosome 6. 

Finally, several genes that showed > 10- fold higher 
expression in the suppressed UACC-903(+6) cells have 
previously been recognized in other models of tumour 
suppression. Most notably, there was elevated expres- 
sion of the mRNA encoding WAFI {p2I ), a key media- 
tor r\ | t m m o u r c , , j nn r pct . , o r ! H y ^ ^ ^ [ r*? 18? T H 0 ^ ! 



ip21 ) mRNA and protein correlates inversely with 
metastatic potential-"". 



These results provide a wide view of the diverse sys- 
tems that are altered in this model system of tumori- 
genicity, and focus attention on specific gene products 
and pathways that may be of particular importance in 
this tumour type. 

Our ability to classify human cancers in a way that 
reflects the underlying molecular pathology or that 
anticipates their potential for progression or response 
to treatment, remains primitive. Using cDNA microar- 
rays to define alterations in gene expression associated 
with a specific cancer may be an efficient way to uncov- 
er clues to the specific molecular derangements that con- 
tribute to its pathogenesis and thus identify potential 
targets for therapeutic intervention. Moreover, recogni- 
tion of pathognomonic alterations in gene expression 
might provide a basis for improved diagnosis and mol- 
ecular classification of cancers and thus allow selection of 
the most appropriate therapeutic strategies. 

Public databases of human expressed gene sequences 
contain partial sequences of at least 40,000 different 
human genes 11 , and efforts to develop a human tran- 
script map have developed rapidly- 1 . Based on the high 

Vield r\i i n t r\ r rrj 3 r j r> n rjKi ( itn^rt 'J^jfJ'J 3J"» 3rr2V of "C ] 000 
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understanding of the gene pathways involved in the 
pathogenesis of cancer and other diseases. 
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Fig. 2 DNA microarray anatysis of changes in gene expression between the tumongenic cell line. UACC-903. and its non-tumorigemc derivative, UACC903(+6), 
derived by introduction of a norma! chromosome 6. a, A ratio image of the results of simultaneous hybridization of Rhodammel 10-labelled cONA (green) 
from UACC-903 and Cy3-tabelied cDNA (orange-red) from UACC-903<+6) to a microarray. To produce this image, the scan images corresponding to each flu- 
orescent probe were combined as the appropriate colour channels tn a single image. Arrows indicate the location wtthm the array of the corresponding 
genes analysed by northern blotting (Fig. 3). b, A magnified image of the area of the array boxed in white in (a), c, Magnified image of three cDNAs mdenti- 
fied by arrows m (a), representing the cDNAs for: left. MCAF/MCP-1 (r/g ratio >10): centre, (i-actm (r/g ratio 1.04); and right. n-l-anUchymotrypstn (r/g ratio 
0.2) [see Fig. 3]. d, simplified representation of ratio hybridization results. Quantitative fluorescence intensity data is extracted from each array target. The aver- 
age target colour ratio determines the hue of each box and the average intensity determines the brightness of each box. in this image, the order of the boxes 
corresponds to their original order m the microliter plate from which they were printed. Duplicate printings of the same plate can be examined side by side, 
as m the first two rows shown here, to assess reproducibility of the hybridization results (see text) Numbered arrows indicate the location within the array cor- 
responding to genes analysed by northern blotting in Fig. 3. 



Methods 

General ion of microarrays, hybridization, scanning. The 
preparation ot coated microscope slides and subsequent robot- 
ic printing ill I )\A was carried mil in a mannci similar to that 
described . Hrielly. prc-clc.med jri.iss slides were treated with 
poly-l -lysine solution \ Stgma i In form an adhesive surface lor 
print tug. Vi'M products, purified hv cthanol purification, were 
resiispeiuled in 3\ SS('.. A custom buili arravim; robot picked 
up and deposited small volumes nanohtersi ol I >NA onto 
the slides Atler printing, the slides were washed m a 0 .2"\> SI )S 
solution. "I he remaining hound DNA was denatured hv sub 
merging the slides in 95 "(! distilled water tot 2 mm followed h\ 
a hnel wash with yv; n ethanol. l)\A was L ! \' ciosslmked lo t he 
slides (Stratagem* Stralalinkcr. fit! nil!, lo prevent non-speeifie 
prohe binding, the slides were blocked In inismc in a solution 
ol 70 inM suveinic anhydride dissoKed in 0.1 M boric add pH 
S.O. containing I - mcthyl-2 - pvrrnhdmnne tAldrich). 

Additional protocols and parts hc{ pertaining to microarrav 
fabrication can Ik obtained from ht!p.//\nigm. Stanford. edit; 



In N. /i\. A separate sean. using the appropriate excitalinr. line, 
was done lor eaeh ot the lv\o fluorophorcs used. Data was col- 
leeied al a maximum resolution of y microns/pixel with 12 bits 
ol depth 

Probe preparation and labelling RN'A was extracted from cells 
usuiv: the 1 na/ol reagent (111 Inc.). following the manufactur- 
et n directions. c 1 )NA piohcs were synthesized from singly ohgo 
d'l -selected (Pharmacia) mKNA pools. Huorescently labelled 
J)NA was prepared from mKNA hv oligo d'l -primed polymer- 
ization using SupeiScitpt II teversc transcriptase ( I ,TI Inc.). 
I he pool ot nucleotides in the labelling reaction was 0.5 niM 
dC.TP. dA'IP and dCI'P and 0.:~mM dTTP. Fluorescent 
nucleotides. Uhodamine I Id dUTP (Peikin Flmer Cetus) or 
0 3 dL'TP ( Amcrsham), were present at 0.1 mM, Probes were 
purified bv gel chromatography ( IboSpm fc/jtioKad) and 
elhanol precipitation. 

Selection of cDNA element* and generation of control tem- 



l" !'"Mti temperature I b hi uli/at n m i u,h earned out al 
b2 °t lor -U h in a water bath Pi mm tit scanning, slide" wvie 
washed in > nm " 2"" SI >s tor ^ mm and 0 > SM ; | or | mm 



•• ^>'t~ promoter 

J I 'ion; eg. m I'nni in use. ! Ik >\ niiu'si/ed KNAs were selected on 
ohgo d I cellulose. The l.ugeM ytoup of c ! >NAs consisted of 674 
cl>\'A clones tioni the I Mil anaved normalized infant brain 

I ■ I t-sv c - .. , >, .....J to i e.a> Mb* 



Fig. 3 Nonnern hyDnOization Suosianltal- 
mg the consistency of the cDNA microar- 
ray results Corresponding tocattons withm 
the cDNA mcroarray illustrated in Fig. 2a 
are provided lor 1 ) Waf- Vp2t. 2) MARCKS: 
3) coltagenase. 4| MCAF/MCP-1; 5) n-1- 
anttchymotrypsm: and 6) 0 -acttn. The sig- 
nal detected by a radio-labelled fS*actm 
probe represents a control for loading van- 
ance, with a red/green ratio observed on 
the cDNA microarray (Fig. 2a.c) tor p-actm 
of 1.04 

to the UniGont- EST clustering sys- 
IC|n :t.:: | nc sccon j largest group of 
clones constMcii of" 1 83 sequenced cDNA 
cloiifs general ei1 by subtraction of i DNA 
from the chromoM»me-fi suppressed 
non-luinorigenic UACC-V03 (+fS) cell 
line vviih cDNA from its parental tumori- 
genic cell line UACC-W (rel. V). 
Approximalely 1 00 .uklitiorul genes 
(total H70 genes a r raved) were obtained 
from KST libraries on the basis of I heir 
a-1-Antichymotrypsin expression pal tern (tissue specific, and so 
on). I\ach array included the lollowuig 
hybridization controls: plasmid vector, 
lambda, OX 174 phage, loial human DNA. 
human Co\\ DNA. and poly (A)\ The 
svnthelic slaiulards used lor normaliza- 
tion of signals in each wavelenglh were 
also arrived, ("onirols were included in 
each iju.ulr.inl of the array to assess the reproducibility of the 
hvbridi/alion signal. Two plates of c DNA clones (derived from 
the UACC-W subtracted library) were also arrayed in dupli- 
cate, l idelitv of the Unigenc array relative to dbKST was lesied 
by sequencing of a random sample ol 11 clones used lor 
microarrav construct ion. All seijueiues were identical with the 



Waf-1 tp21 



MARCKS 



Collagenase 
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corresponding iibl VI entites. AilihiioiialU. e.u n mMH.ni.m,! 
cDNA from ihe l'A(X-V03 subtracted hhr.irv u.iv ha]uuki\! \ 
listing of cDNAs comprising this mumarrax *Wiuh wcu 
derived Irom the I'mgene and 'housekeeping panel «..m iv 
obtained from h it p://\vww.n ih.gov/ D I H/l.CC./AKK .M opn • 
html. 

Northern blot analysis. Total UNA, 10 ug per lane. *\a<. elec- 
Irophnresed in 1 .2'S» agarose-lormaldehvde gels and iranslerred 
onto nylon membrane (Hybond-N*, Amersham) by capillary 
blotting overnight. For DNA probes insert fragments liom ihc 
Soares I Nib cDNA library 1 " were obtained by vector PCR for 
p21, MAUCKS. a-l-antichyni(>lrypsin and P-actin. Piobes lor 
fibroblast collagenase and MCAF/Ml'.P-l were isolated irom a 
UACC-VOJl+M enriched cDNA lilu.ir\" with all probes 
labelled bv random priming, f illers were washed (n a stun- 
gency of 0. Ix SSC at -12 *C lor 20 mm 

Web sites. hltp://cmgm. slanlttrd.edu/pbiown for pmiocols and 
parts list pertaining to micioarrav labncaiion. 
hi t p:// www. nc hgr. n ih. gov/ D I R/ 1. (".("./AURAY/expn. htm I tor a 
listing of cDNAs comprising ibis microarray which were 
derived from the Unigenc and housekeeping panel. 
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A DNA Microarray System for Analyzing 
Complex DNA Samples Using Two-color 
Fluorescent Probe Hybridization 

Dari Shalon, 1 ' 4 Stephen J. Smith, 3 and Patrick O. Brown 1 2 5 

toward Hughes Medical Institute and Departments of biochemistry and ^Molecular and Cellular 
Physiology, Stanford University, Stanford, California 94305 



Detecting and determining the relative abundance of diverse individual sequences in complex DNA samples is 
a recurring experimental challenge in analyzing genomes. We describe a general experimental approach to 
this problem, using microscopic arrays of DNA fragments on glass substrates for differential hybridization 
analysis of fiuorescently labeled DNA samples. To test the system, 864 physically mapped A clones of yeast 
genomic DNA, together representing >75% of the yeast genome, were arranged into L8-cm x 1.8<m arrays, 
each containing a total of 1744 elements. The microarrays were characterized by simultaneous hybridization 
of two different sets of isolated yeast chromosomes labeled with two different fluorophores. A laser 
fluorescent scanner was used to detect the hybridization signals from the two fluorophores. The results 
demonstrate the utility of DNA microarrays in the analysis of complex DNA samples. This system should 
find numerous applications in genome-wide genetic mapping, physical mapping, and gene expression studies. 



Many problems in genome analysis depend on 
determining what specific sequences are repre- 
sented in a complex DNA or RNA sample and at 
what abundance, for example, what genes are 
represented in a specific chromosome band or 
YAC clone, what intervals are amplified or de- 
leted in a particular cancer cell, or what genes are 
expressed in specific cells under specific condi- 
tions. As a general approach to this problem, we 
have developed a system for making microarrays 
of DNA samples on glass substrates, probing 
them by hybridization with complex fluorescent- 
labeled probes, and using a laser-scanning micro- 
scope to detect the fluorescent signals represent- 
ing hybridization. Fluorescent labeling allows for 
simultaneous hybridization and separate detec- 
tion of the hybridization signal from two or more 
probes. This in turn allows very accurate and re- 
liable measurement of the relative abundance of 
specific sequences in two complex samples. 

RESULTS 

Array Hybridization Panern 

Figure 1 shows the two-color fluorescent scan of 

o>rr*fcf>ortdtftcj •utfK« 
f MAIL pbrownfcmqm.itAnf ord tdu, http., cmqm 
>i*nroro eflu pDrown. ?aa 72> l39>. 



with a mixed probe consisting of lissamine- 
labeled DNA from the 6 largest yeast chromo- 
somes together with fluorescein-labeled DNA 
from the 10 smallest yeast chromosomes. A red 
color indicates that yeast sequences present in 
the lissamine-labeled hybridization probe hy- 
bridized to an array element. A yellow-green 
color indicates that yeast sequences present in 
the fluorescein-labeled hybridization probe hy- 
bridized to an array element. An orange color in- 
dicates cross-hybridization of both chromosome 
pools to an array element (e.g., dispersed repeti- 
tive elements, such as Tyl elements). 

Each clone was spotted twice, resulting in du- 
plicate hybridization patterns in adjacent quad- 
rants of the array. Control DNA spots, which 
were randomly amplified in the same manner as 
the x clone array elements, are located in the bot- 
tom corner of each quadrant. V'A" points to a pair 
of spots containing total yeast genomic DNA. 
These spots appear orange because both chromo- 
some pools hybridized to yeast genomic DNA. 
The negative controls are as follows: "B" points 
to a pair of spots of wild-type X DNA, "C" points 
to a pair of human genomic DNA spots, and "D" 

. on iK;. ipwo , . , ^ i w d i r i uui \ ",'it n vDi'iuizauon wa> 
ipeunc iui vtdM sequences. 
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Figure 1 Two-color fluorescent scan of a 1 .8<m x 1 .8-cm yeast array 
of K clones of yeast genomic DNA. The DNA spots are spaced at a 
distance of 380 p.m from center to center. A probe mixture consisting of 
DNA from the 6 largest yeast chromosomes (4, 7, 1 2, 1 3, 1 5, 1 6) labeled 
with lissamine (red dots) and DNA from the 1 0 smallest yeast chromo- 
somes (1, 2, 3, 5, 6, 8, 9, 10, 11, 14) labeled with fluorescein (yellow- 
green dots) was hybridized to the array. A pair of yeast genomic DNA 
spots (A) served as a positive control. The three negative controls are X 
DNA (5), human genomic DNA (C), and 6X1 74 DNA (D). 



Karyorype Depiction of the Array Hybridization 
Pattern 

The inserts contained in the arrayed X clones 
have been mapped physically (Riles et al. 1993). 
The clones are arrayed in a random but known 
order on the array. Therefore, using the identity 
of each clone along with its physical map infor- 
mation, the pattern of hybridization to the yeast 
array can be represented in the form of a karyo- 
type of the yeast genome, as shown in Figure 2. 
The color of any segment of the ideogram repre- 
senting an individual chromosome on the karyo- 



inserts. The chromosome seg- 
ments colored black represent ei- 
ther intervals of the genome that 
are not represented by clones in 
the library (90%> or false-negative 
hybridization signals on the array 
(10%). Most of these talse nega- 
tives are attributable to failures of 
the PCR amplification of the x 
clones, though occasional failures 
of. the arraying process or nonuni- 
form surface preparation could ac- 
count for a small fraction of the 
false-negative signals. The large 
gap on chromosome 12 is the re- 
gion coding for ribosomal DNA 
that was not represented among 
the arrayed clones. Genomic inter- 
vals represented by overlapping 

on the hybridization signals of 
only one of the overlapping 
clones, chosen at random. 

Note that in this representa- 
tion of a yeast karyotype, the larg- 
est six chromosomes are mainly 
colored red. This indicates that 
most of the arrayed clones that 
were mapped previously to these 
six large chromosomes hybridized 
primarily to the lissamine-labeled 
probe prepared from the corre- 
sponding purified chromosomes. 
Conversely, the smallest 10 chro- 
mosomes are mainly colored green 
in this image, matching the origi- 
nal CHEF gel isolation of the chro- 
mosomes used as the hybridization probe. The 
experiment was repeated with the yeast genome 
split into six discrete chromosome pools contain- 
ing 2-4 chromosomes per pool using CHEF gel 
electrophoresis. The chromosomes in each pool 
were extracted from the gel, amplified, and fluo- 
rescently labeled. The six chromosome pools 
were hybridized to six separate yeast arrays. 
Forty-four X clones gave a positive hybridization 
signal on all six arrays indicating that they con- 
tain yeast repetitive sequences (data not shown). 
These 44 clones and 10 clones with very weak 



correspond to the physical lengths of the yeast 
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bands on the otherwise red chromosomes or red 
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Figure 2 Computer-generated ideogram repre- 
senting a karyotype of 5. cerevisiae, based on the 
normalized hybridization signals from the array 

SnOwu in riy. 1 . Note that the 6 idryeM Chromo- 
somes are mainly red and the 10 smallest chromo- 
somes are mainly green. Black stripes represent in- 
tervals not represented by clones in the array or for 
which the corresponding clones gave false-negative 
hybridization signals. 

bands on the otherwise green chromosomes. 
Four randomly chosen examples of these anoma- 
lous clones were analyzed by hybridizing the 
clones to vertical strips cut from a Southern blot 
of CHEF gel-separated yeast chromosomes. In 
each case, the hybridization patterns of the 
anomalous clones corroborated the chromo- 
somal locations assigned by the microarray hy- 
bridization results (data not shown). Two clones 
that were thought to map to the 10 smallest chro- 
mosomes were found to hybridize preferentially 
to the probe representing the 6 largest chromo- 
somes and thus appear as anomalous red bands 
on the karyotype. Both hybridized to one of the 
six largest chromosomes on the Southern blot. 
Similarly, rwo clones that appear as anomalous 
green bands on the karyotype were found to hy- 
bridize to one of the 10 smallest chromosomes on 
the Southern blot. Thus, the anomalous clones 
are probably the result of sample tracking errors 
or, possibly, of errors in the published restriction- 
digest-based physical map on which the karyo- 
type representation was based (Riles et al. 1993). 
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similar to fluorescent in situ hybridization (FISH) 
to metaphase chromosomes, with three impor- 
tant differences. First, the target elements of the 
microarrays can, in principle, be any length or 
composition, from megabase YAC clones or mi- 
crodissected chromosome bands to individual 
cDNA clones, to short oligonucleotides. This ver- 
satility allows the user to choose characteristics, 
such as the mapping resolution and genetic com- 
plexity of each array element, to suit a particular 
application. Second, the hybridization signals are 
localized to discrete elements of known size and 
location, making them easier to identify and 
quantitate than the hybridization signals from 
irregularly shaped metaphase spreads. Third, mi- 
croarrays are more consistent and potentially 
amenable to automated production, hybridiza- 
tion, and data analysis than metaphase spreads. 

Arrays of DNA samples on porous mem- 
branes, for example, dot blots, have long been 
used as a basic tool in molecular biology. Dot- 
blot membranes are usually at least 8 x 12 cm in 
size, require the use of milliliter volumes of hy- 
bridization solution, and are limited, owing to 
autofluorescence and scattering, to radioactive, 
chemiluminescent, and colorimetric hybridiza- 
tion detection methods (Ross et al. 1992). Micro- 
arrays made on glass surfaces, on the other hand, 
can be mass-produced and are comparatively in- 
expensive, convenient, and compatible with 
fluorescent hybridization detection methods. 
Furthermore, a glass surface, when appropriately 
treated, has very low nonspecific binding of la- 
beled hybridization probes, resulting in lower 
backgrounds than are encountered typically with 
porous membranes. For hybridizations with very 
complex probes, the concentration of the labeled 
probe DNA is a limiting factor in the sensitivity 
of the assay. Minimizing the volume of the probe 
solution in a hybridization, by restricting the tar- 
get to a small area and by using a nonporous 
substrate, makes it practical to achieve very high 
probe concentrations. 

One important advantage of fluorescently la- 
beled probes is that, unlike most radioactive and 
chemiluminescent signals, fluorescent signals do 
not disperse and therefore allow for very dense 
array spacing. A unique, and probably the most 
important, advantage of fluorescent probes is 
that the hybridization signals from two or more 
t¥prf " , — * ■ ■ ^ " ' - ,4fT^ 

*a> , rwo-cojor nyonaization detection allows tor 
a direct and quantitative comparison of the 
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abundance of specific sequences between two 
probe mixtures that are hybridized competitively 
to a single array. The absolute intensity of a hy- 
bridization signal at a particular element in an 
array can vary owing to experimental factors 
such as variations in the amount of DNA depos- 
ited on the array, variations in the hybridization 
or wash conditions between experiments, or 
variations in the hybridization characteristics of 
the different DNA sequences on the array. The 
ratio of the two signals at any element in an ar- 
ray, however, is relatively insensitive to these 
confounding factors because they affect both 
probe mixtures equivalently. This ratio therefore 
accurately reflects the relative abundance of the 
cognate sequence in the two probe samples. This 
is the principle underlying the technique of com- 
parative genomic hybridization (CGH), which is 
used to detect changes in the copy number of 
specific chromosomes or chjofnosonial regions 
(Kallioniemi et al. 1992). CGH is based on mea- 
suring the relative fluorescent hybridization in- 
tensities of two genomic-complexity hybridiza- 
tion probes, for example, probes representing ge- 
nomic DNA from normal and affected tissue 
samples, which are labeled with two distinct fluo- 
rophores and hybridized simultaneously to a 
metaphase spread. DNA microarray representa- 
tions of the human genome may provide a more 
convenient and higher resolution alternative to 
metaphase chromosomes for CGH. 

Cross-hybridization between related se- 
quences is an important problem faced by any 
hybridization-based assay, including the DNA 
microarray assay described here. Studies are now 
in progress to quantitate the extent of cross- 
hybridization between related sequences of vary- 
ing homology and length, in DNA microarray 
hybridizations. The stringency of hybridization 
and washing can be controlled by varying the salt 
concentration and temperature as in conven- 
tional membrane-based hybridizations. Cross- 
hybridization caused by repetitive sequences can 
be minimized by prehybridization of the probe or 
array with vast excess of unlabeled copies of the 
repetitive sequences. 

Alternative methods have been described for 
making microarrays of very short DNA se- 
quences, involving photolithography (Pease et 

dit mntrenuv umiteu 10 sow complexity d, 
ray elements consisting 01 ougunuueouues. For 
complex-probe hybridizations, the specificity of 



hybridization is improved by using DNA frag- 
ments substantially longer than oligonucleo- 
tides. Moreover, the in situ synthesis approaches 
to array fabrication depend on prior knowledge 
of the sequence to be recognized by each array 
element. The approach described here makes mi- 
aoarrays by transferring tiny volumes of DNA 
samples from microwell storage plates to a solid 
substrate. Thus, nucleic acids (or other mol- 
ecules) of virtually any length or any origin can 
be arrayed, and knowledge of their sequences is 
not required. 

The arrays used in these experiments do not 
represent the maximal achievable density of ele- 
ments. We have found that the spacing between 
the spots can be decreased by shrinking the con- 
tact area of the printing tip and by increasing the 
hydrophobicity of the glass surface. Microarrays 
with 100-M.m feature size have been tested suc- 
cessfully in pilot experiuieuo (data not shown). 
Assuming the projected availability of the appro- 
priate physically mapped human genomic clones 
(Hudson et al. 1995), arrays at 100-jim spacing 
would allow for 10,000 discrete intervals of the 
human genome to be represented in a 1-cm 2 ar- 
ray. Such an array could be used for mapping at a 
resolution of <0.5 Mb. Experiments are in 
progress to explore the feasibility of such arrays. 

Our initial motivation for developing these 
microarrays arose from the need for abundant 
and inexpensive genomic arrays for genomic 
mismatch scanning (GMS) (Nelson et al. 1993), a 
method of genetic linkage analysis based on 
identification of the regions of "identity by de- 
scent" between affected relative pairs using a 
single complex-probe hybridization to an array 
of genomic clones. Experiments using these ar- 
rays to map quantitative trait loci in yeast by 
GMS are currently in progress 0- deRisi, D. Lash- 
kari, L. Penland, L. McAllister, J. McCusker, R. 
Davis, and P.O. Brown, unpubl.). 

Microarrays of cDNA clones, prepared using 
the system described here, have been used for 
quantitative monitoring of gene expression pat- 
terns in Arabidopsis (Schena et al. 1995), 5. cerevi- 
siae (D. Lashkari, J. deRisi, L. Penland, P.O. 
Brown, and R. Davis, unpubl.), and human tis- 
sues 0- deRisi, M. Bittner, P. Meltzer, L. Penland, 
J. Trent, and and P.O. Brown., unpubl ). We an- 
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tions include comparative genomic hybridiza- 
tion (Kallioniemi et al. 1992), sequencing by hy. 
bridization fDrmanac et al. 1993), physical map- 
ping of cloned or amplified sequences (Billings et 
al. 1991), and economical distribution of re- 
agents for integrated genetic and physical map- 
ping based on a common set of arrayed clones 
(Zehemer and Lehrach 1994). 

METHODS 

Amplification of Target DNA Elements 

The array elements were prepared from physically mapped 
X clones (Riles et al. 1993). The X clones were amplified 
using randomly primed polymerase chain reaction (PCR) 
based on published and unpublished protocols (Bohlander 
et al. 1992; S. Nelson, unpubl.). The phage lysates were 
amplified in a JO-m-1 PCR reaction using 5 jlm final concen- 
tration of primer A (GCTATCTTCAAGATCANT^TWNN), 
200 dNTPs. and 1 unit of Taq polymerase. Round A 
consisted of five cycles at 94M for 1 min, 25 # C for 1.5 mm, 
25-72'C over 7 min, and 72*C for 3 min using Taq poly, 
merase (BMB). For round B, the reaction volume was 
brought up to 100 mJ for a final concentration of 2 yiM of 
primer B (GCTATCTTCAAGATCA), 200 m-m dNTPs, and 4 
units of Taq polym erase. Round B consisted of 30 cycles of 
94*C for 1 min, 56'C for 2 min, and 72*C for 3 min. The 
amplification was performed in 96-well plates using crude 
phage lysates as the templates, resulting in an amplifica- 
tion of both the 35-kb x vector and the 5-kb to 15-kb yeast 
in sen sequences as a distribution of PCR products between 
250 bp and 1500 bp in length. 

The PCR products were purified and transferred into 
TE 00 mM Tns, 1 mM EDTA at pH 8.0) buffer using Sepha- 
dex G50 gel filtration (Pharmada) and evaporated to dry- 
ness at room temperature overnight. Each of the 864 am- 
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plified X clones was rehydrated in 15 nl of 3x SSC 
(20 x SSC * 3 v NaCl. 03 u Na 3 dtrate) in preparation (or 
sporting onto the glass under normal room temperature 
conditions. 



Preparation of DNA Microarrays 

The micToanays were fabncated on poly-L-lysine coated 
microscope slides (Sigma). A custom-built arraying ma- 
chine, consisting of four tweezer-like printing tips 
mounted 9 mm apart on a computer-controlled robotic 
stage (Sha)on 1996), loaded 1 nl of the concentrated PCR 
product directly from corresponding clusters of four wells 
of 96-well storage plates and deposited -5 nl of each 
sample onto each of 40 slides. Surface tension loaded the 
sample into the pnnting tip directly from the miaowell 
plate and held the sample in the tip during the printing 
operation. Pnnting was achieved by lightly tapping the tip 
against the glass surface. The open-capillary design al- 
lowed for rapid nnsing and drying of the dps between 
samples. Figure 3 shows the layout of the arraying ma- 
chine. Figure 4 shows a detailed view of the four printing 
tips and the staggered pnnting partem on the microscope 
slides. Adjacent samples were spotted 380 nm apart on the 
slides. After each set of four samples was pnnted onto 40 
slides, the printing tips were rinsed with a jet of water for 
2 sec and then dried by lowering the tips onto a sponge for 
2 sec. The process was repeated for all 864 samples and 
eight control spots. 

After the spotting operation was complete, the slides 
were rehydrated in a humid chamber at room temperature 
for 2 hr, baked in an 80*C vacuum oven for 2 hr, then 
nnsed in 0.1% sodium dodecyl sulfate (SDS) to remove 
unadsorbed DNA. To reduce nonspecific adsorption of the 
labeled hybridization probe to the poly-i-lysine coated 
glass surface, the slides were treated with succinic anhy- 
dride. One gram of succinic anhydride was dissolved in 
100 ml of l-methyl-2-p\TTolidinone and then 100 ml of 
0.2 m bone acid tpH 8.0) was 
added. The arrays were soaked in 
this solution for 10 min and then 
rinsed in distilled water four 
times for 5 min each. Immedi- 
ately before use, the arrayed DNA 
elements were denatured by plac- 
ing the slide in distilled water at 
90*C for 2 mm. 



Amplification and Labeling 
of Hybridization Probe 

The 16 chromosomes of Saccharo* 
mvcts cerrvtsiae were separated us- 
ing a contour-clamped homoge- 
neous electric field (CHEF) aga- 
rose gel apparatus (Bio-Rad) (Qui 
et a! 1986) The 6 largest chromo- 
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rif DNA rrorn racn slice was re- 
covered using a gei extraction kit 
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Figure 4 A close-up view of the four open- 
capillary printing tips. The tips are 9 mm apart and 
fit into four adjacent wells of a standard microwell 
plate and print arrays in a staggered fashion on mi- 
croscope slides. For more details of the printing tips, 
see web page http://cmgm.stanford.edu/pbrown. 



(Qiagen) and randomly amplified in a manner similar to 
that used in amplifying the target \ clones (Grothues et al. 
1993). The main difference between this amplification 
procedure and the one used for the X array elements is a 
filtration step between rounds A and B to remove primer- 
dimers and the use of a random 9-mer 3' end on primer A. 
Following amplification, 2.5 »ig of each of the amplified 
chromosome pools were separately random-primer labeled 
using Kienow polymerase (Amersham) with a lissam un- 
conjugated nucleotide analog (DuPont NEN) for the pool 
containing the 6 largest chromosomes and with a fluores- 
cem-conjugated nucleotide analog (BMB) for the pool con- 
taining the smallest 10 chromosomes. The two fluores- 
cent-labeled pools were mixed and concentrated using an 
ultrafiltration device (Amicon). 



Hybridization 

Five micrograms of the hybndization probe, consisting of 
both chromosome pools in 7.5 pJ of TE, was denatured in 
a boiling water bath and then snap-cooled on ice. Concen- 
trated hybndization solution (2.5 pJ) was added to a final 
concentration of 5x SSC/0.1% SDS. The entire 10 mJ of 
probe solution was transferred to the arTay surface, covered 
with a coverslip, placed in a custom-built single-slide hu- 
midity chamber, and incubated in a 60*C water bath for 12 
nr. The custom-built waterproof slide chamber has a cavity 
just slightly bigger than a microscope slide and was kept at 
100% humidity internally by the addition of 2 mJ of water 
in a corner of the chamber. The slide was rimai in 5 x 
SSC/0.1% SDS for S min and then in 0.2 x SSC/0.1% SDS 
for 5 mm. All rinses were at room temperature. The arTay 
was then air dried, and a drop of antifade (Molecular 
Probes) was applied to the array under a 24- mm x 30-mm 
cr>vpT5hp in preparation for scanning 
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A custom-built laser scanner was used to detect the two- 



color fluorescence hybridization signals from 1.8- 
cm x 1.8-cm arrays at 20-nm resolution. The glass sub- 
strate slide was mounted on a computer-controlled, two- 
axis translation stage (PM-500. Newport. Irvine. CA) that 
scanned the anay over an upward-facing microscope ob- 
jective (20 x, 0.75NA Fluor, Nikon. Melville. NY) m a bi- 
directional raster pattern. A water-cooled .Argon/ Krypton 
laser tlnnova 70 Spectrum. Coherent. Palo Alto. CA). op- 
erated in multiline mode, allowed for simultaneous speci- 
men illumination at 488.0 nm and 568.2 nm. These rwo 
lines were isolated by a 488/568 dual-band excitation filter 
(Chroma Technology, Brattleboro. VT). An epifiuores- 
cence configuration with a dual-band 488/568 primary 
beam splitter (Chroma) excited both fluorophores simul- 
taneously and directed fluorescence emissions toward the 
two-channel detector. Emissions were split by a secondary 
dichroic mirror with a 565 transition wavelength onto two 
multialkali cathode photomultiplier tubes (PMT; R928. 
Hamamatsu, Bridgewater, NJ), one with an HQ53S/50 
bandpass barrier filter and the other with a D630/60 band- 
pass barrier filter (Chroma). Preamplified PMT signals were 
read into a personal computer using a 12-bit analog-to- 
digital conversion board (KT1-834. Analog Devices. Nor- 
wood, MA), displayed in a graphics window, and stored to 
disk for further rendering and analysis. The back aperture 
of the 20 x objective was deliberately underfilled by the 
illuminating laser beam to produce a large-diameter illu- 
minating spot at the specimen (5-fi.m to 10-nm half- 
width). Stage scanning velocity was 100 mm/sec, and PMT 
signals were digitized at 100 jLsec intervals. Two successive 
readings were summed for each pixel such that pixel spac- 
ing in the final image was 20 »un. Beam power at the 
specimen was -5 mW for each of the two lines. 

The scanned image was despeckled using a graphics 
program (Hijaak Graphics Suite) and then analyzed using 
a custom image gridding program that created a spread- 
sheet of the average red and green hybridization intensi- 
ties for each spot. The red and green hybridization inten- 
sities were corrected for optical cross talk between the fluo- 
rescein and lissamine channels, using experimentally 
determined coefficients. 
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Contributed by Ronald W Davis, December 27, 1996 

ABSTRACT cDNA microarray technology is used to profile 
complex diseases and discover novel disease-related genes. In 
inflammatory disease such as rheumatoid arthritis, expression 
patterns of diverse cell types contribute to the pathology. We 
have monitored gene expression in this disease state with a 
microarray of selected human genes of probable significance in 
inflammation as well as with genes expressed in peripheral 
human blood cells. Messenger RNA from cultured macrophages, 

„w~~,i^~,t* r«U nrtmarv rhnndroevtes. and SVIIOVioCVteS 

provided expression profiles for the selected cytokines, chemo- 
kines, DNA binding proteins, and matrix-degrading metal- 
loproteinases. Comparisons between tissue samples of rheuma- 
toid arthritis and inflammatory bowel disease verified the in- 
volvement of many genes and revealed novel participation of the 
cytokine interleukin 3, chemokine Groa and the metal- 
loproteinase matrix metallo-elastase in both diseases. From the 
peripheral blood library, tissue inhibitor of metalloproteinase 1, 
ferritin light chain, and manganese superoxide dismutase genes 
were identified as expressed differentially in rheumatoid arthri- 
tis compared with inflammatory bowel disease. These results 
successfully demonstrate the use of the cDNA microarray system 
as a general approach for dissecting human diseases. 



The recently described cDNA microarray or DNA-chip tech- 
nology allows expression monitoring of hundreds and thou- 
sands of genes simultaneously and provides a format for 
identifying genes as well as changes in their activity (1, 2). 
Using this technology, two-color fluorescence patterns of 
differential gene expression in the root versus the shoot tissue 
oiArabidopsis were obtained in a specific array of 48 genes ( 1 ). 
In another study using a 1000 gene array from a human 
peripheral blood library, novel genes expressed byT cells were 
identified upon heal shock and protein kinase C activation (3). 

The technology uses cDNA sequences or cDNA inserts of a 
library for PCR amplification that are arrayed on a glass slide with 
high speed robotics at a density of 1000 cDNA sequences per cm 2 . 
These microarrays serve as gene targets for hybridization to 
cDNA probes prepared from RNA samples of cells or tissues. A 
two-color fluorescence labeling technique is used in the prepa- 
ration of the cDN A probes such that a simultaneous hybridization 
but separate detection of signals provides the comparative anal- 
ysis and the relative abundance of specific genes expressed (1,2). 
Microarrays can be constructed from specific cDNA clones of 
interest, a cDNA library, or a select number of open reading 
frames from a genome sequencing database to allow a large-scale 
functional analysis of expressed sequences. 
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Because of the wide spectrum of genes and endogenous 
mediators involved, the microarray technology is well suited 
for analyzing chronic diseases. In rheumatoid arthritis (RA), 
inflammation of the joint is caused by the gene products of 
many different cell types present in the synovium and cartilage 
tissues plus those infiltrating from the circulating blood. The 
autoimmune and inflammatory nature of the disease is a 
cumulative result of genetic susceptibility factors and multiple 
responses, paracrine and autocrine in nature, from macro- 
phages, T cells, plasma cells, neutrophils, synovial iibrobiasts, 
chondrocytes, etc. Growth factors, inflammatory cytokines 
(4), and the chemokines (5) are the important mediators of this 
inflammatory process. The ensuing destruction of the cartilage 
and bone by the invading synovial tissue includes the actions 
of prostaglandins and leukotrienes (6), and the matrix degrad- 
ing metalloproteinases (MMPs). The MMPs are an important 
class of Zn-dependent metallo-endoproteinases that can col- 
lectively degrade the proteoglycan and collagen components of 
the connective tissue matrix (7). 

This paper presents a study in which the involvement of 
select classes of molecules in RA was examined. Also inves- 
tigated were 1000 human genes randomly selected from a 
peripheral human blood cell library. Their differential and 
quantitative expression analysis in cells of the joint tissue, in 
diseased RA tissue and in inflammatory bowel disease (IBD) 
tissues was conducted to demonstrate the utility of the mi- 
croarray method to analyze complex diseases by their pattern 
of gene expression. Such a survey provides insight not only into 
the underlying cause of the pathology, but also provides the 
opportunity to selectively target genes for disease intervention 
by appropriate drug development and gene therapies. 

METHODS 

Microarray Design, Development, and Preparation. Two ap- 
proaches for the fabrication of cDNA microarrays were used in 
this study. In the first approach, known human genes of probable 
significance in RA were identified- Regions of the clones, pref- 
erably 1 kb in length, were selected by their proximity to the 3' end 
of the cDNA and for areas of least identity to related and 
repetitive sequences. Primers were synthesized to amplify the 
target regions by standard PCR protocols (3). Products were 



Abbreviations: RA, rheumatoid arthritis; MMP, matrix- degrading 
metalloproteinase; IBD, inflammatory bowel disease; LPS, lipopoly- 
sacchandc; PMA, phorbol 12-myrtstate 13-acetaie; TNF-a. rumor 
necrosis factor a; IL, interleukin; TGF-0. transforming growth factor 
3; GCSF, granulocyte colony-stimulating factor; MIP, macrophage 
inflammatory protein; MIF. migration inhibitory factor; HME, human 
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verified by gel electrophoresis and purified with Qiaquick 96-we!l 
purification kit (Qiagen, Chaisworth, CA). Wophilized I (Savant), 
and resuspended in 5 ^ of 3 x standard saline citrate (SSC) buffer 
for arraying. In the second approach, the microarray containing 
the 1056 human genes from the peripheral blood lymphocyte 
library was prepared as described (3). 

Tissue Specimens. Rheumatoid synovial tissue was obtained 
from patients with late stage classic RA undergoing remedial 
synovectomy or arthroplasty of the knee. Synov.al tissue was 
separated from any associated connective tissue or fat. One 
gram of each svnovial specimen was subjected to RNA extrac- 
tion within 40 min of surgical excision, or explants were 
cultured in serum-free medium to examine any changes under 
m vitro conditions- For IBD, specimens of macroscopically 
inflamed lower intestinal mucosa were obtained from patients 
with Crohn disease undergoing remedial surgery. The hypcr- 
trophied mucosal tissue was separated from underlying con- 
ncctive tissue and extracted for RNA. 

Cultured Cells. The Mono Mac-6 (MM6) monocytic cells 
(8) were grown in RPMI medium. Human chondrosarcoma 
SW1353 cells, primary human chondrocytes, and synoviocytes 
(9, 10) were cultured in DMEM; all culture media were 
supplemented with 10% fetal bovine serum, 100 ng/ml strep- 
tnmvcin. and 500 units/ml penicillin. Treatment of cells with 
lipopolysaccharide (LPS) endotoxin at 30 ng/ml, phorbol 
12-myristate 13-acetate (PMA) at 50 ng/ml, tumor necrosis 
factor a (TNF-a) at 50 ng/ml, interleukin (IL)-ip at 30 ng/ml, 
or transforming growth factor-0 (TGF-0) at 100 ng/ml is 
described in the figure legends. 
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Fluorescent Probe, Hybridization, and Scanning. Isolation of 
mRNA, probe preparation, and quantitation with Ambtdopsis 
control mRNAs was essentially as described (3) except for the 
following minor modificatioa Following the reverse transcriptase 
step, the appropriate Cy3- and Cy5-labcled samples were pooled; 
mRNA degraded by heatine the sample to 65'C for 10 min with 
the addition of 5 ^ of 05M NaOH plus Oi ml of 10 mM EDTA. 
The pooled cDNA was purified from unincorporated nucleotides 
by gel filtration in Centri-spin columns (Princeton Separations, 
Adelphia, NJ). Samples were rvophilized and dissolved in 6 *il of 
hybridization buffer (5x SSC plus 02% SDS). Hybridizations, 
washes, scanning, quantitation procedures, and pseudocolor rep- 
resentations of fluorescent images have been described (3). Scans 
for the two fluorescent probes were normalized either to the 
fluorescence intensity of Arvbidopsis mRNAs spiked into the 
labeling reactions (see Figs. 2-4) or to the signal intensity of 
0-actin and glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH; see Fig. 5). 

RESULTS 

Ninety-Six-Gene Microarray Design. The actions of cytokines, 
growth factors, chemokines, transcription factors, MMPs, pros- 
taglandins, and leukotrienes are well recognized in inflammatory 
disease, particularly RA (11-14). Fig. 1 displays the selected genes 
lor tnis study ana aiso inuuuca* tuimui wi^rw v» i.wwwkwv^..^ 
genes such as 0-actin and GAPDH and genes from Arabidopsis 
for signal normalization and quantitation (row A, columns 1-12). 

Defining Microarray Assay Conditions. Different lengths and 
concentrations of target DNA were tested by arraying PCR- 
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amplified products ranging from 02 to 1.2 kb at concentrations 
of 1 ^ig/fil or less. No significant difference in the signal levels was 
observed within this range of target size and only with 0.2-kb 
length was a signal reduced upon an 8-fold dilution of the 1 ^g/^ 
sample (data not shown). In this study the average length of the 
targets was 1 kb, with a few exceptions in the range of -300 bp, 
arrayed at a concentration of 1 figM NormaUy one PCR pro- 
vided sufficient material to fabricate up to 1000 microarray targets. 

In considering positional effects in the development of the 
targets for the microarrays, selection was biased toward the 3' 
proximal regions, because the signal was reduced if the target 
fragment was biased toward the 5' end (data not shown). This 
result was anticipated since the hybridizing probe is prepared by 
reverse transcription with oligo(dT)-primed mRNA and is ncher 
in 3' proximal sequences. Cross-hybridizations of probes to 
targets of a gene family were analyzed with the matrix metai- 
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loproteinases as the example because they can show regions of 
sequence identities of greater than 70%, With collagenase-1 
(Col-1 ) and collagenase-2 (Col-2) genes as targets with up to 70<t 
sequence identity, and stromerysin-1 (Strom- 1) and stromerysin-: 
(Strom-2) genes with different degrees of identity, our results 
showed that a short region of overlap, even with 70-90fr se- 
quence identity, produced a low level of cross- hybridization. 
However, shorter regions of identity spread over the length of the 
target resulted in cross-hybridization (data not shown). For 
closely related genes, targets were designed by avoiding long 
stretches of homology. For members of a gene family two or more 
target regions were included to discriminate between specifiary 
of signal versus cross- hybridization. 

Monitoring Differential Expression in Cultured Cell Lines. In 
RA tissue, the monocyte/macrophage population plays a prom- 
inent role in phagocytic and immunomodulatory activities. Typ- 
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icalry these cells, when triggered by an immunogen, produce the 
promflammairoy cytokines TNF and IL-1. We have used the 
monocyte cell line MM6 and monitored changes in gene expres- 
sion upon activation with LPS endotoxin, a component of Gram- 
negative bacterial membranes, and PMA, which augments the 
action of LPS on TNF production (15). RNA was isolated at 
different times after induction and used for cDNA probe prep- 
aration. From this time course it was clear that TNF expression 
was induced within 15 min of treatment, reached maximum levels 
in 1 hr, remained high until 4 hr and subsequently declined (Fig. 
2A\ Many other cytokine genes were also transiently activated, 
such as IL-1 a and -/3, 11^6, and granulocyte colony-stimulating 
factor (GCSF). Prominent chemokmes activated were IL-8, mac- 
rophage inflammatory protein (MIPHft more so than MlP-la, 
and Groa or melanoma growth stimulatory factor. Migration 
inhibitory factor (MIF) expressed in the uninduccd state declined 
in LPS-activated cells. Of the immediate carry genes, the notice- 
able ones were c-fos,fra-L c-jun, NF-KBp50, and IkB, with c-rel 
expression observed even in the uninduccd state (Fig. IB). These 
expression patterns are consistent with reported patterns of 
activation of certain LPS- and PMA-induced genes (12). Dem- 
onstrated here is the unique ability of this system to allow parallel 
visualization of a large number of gene activities over a period of 

SW1353 cells is a line derived from malignant tumors of the 
cartilage and behaves much like the chondrocytes upon stim- 
ulation with TNF and IL-1 in the expression of MMPs (9). In 
addition to confirming our earlier observations with Northern 
blots on Strom-1, Col-1, and Col-3 expression (9), gelatinase 
(Gel) A, putative metalloproteinase (PTJMP)-l membrane- 
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type matrix metalloproteinase, tissue inhibitors of matrix 
metalloproteinases or tissue inhibitor of metalloproteinase 1 
(TIMP-1 ), -2. and -3 were also expressed by these cells together 
with the human matrix mctallo-elastase (HME: Fig. 14). HME 
induction was estimated to be -50-fold and was greater than 
any of the other MMPs examined (Fig. 3B). This result was 
unexpected because HME is reportedly expressed only by 
alveolar macrophage and placental cells (16). Expression of 
the cytokines and chemokmes. IL-6, IL-8, MIF. and MIP-10 
was also noted. A variety of other genes, including certain 
transcription factors, were also up-regulated (Fig. 3). but the 
overall time-dependent expression of genes in the SW1353 
cells was qualitatively distinct from the MM6 cells. 

Quantitation of differential gene expression (Figs. IB and 
3B) was achieved with the simultaneous hybridization of 
Cy3-labeled cDNA from untreated cells and Cy5-labeled 
cDNA from treated samples. The estimated increases in 
expression from these microarrays for a select number of genes 
including IL-10, IL-8, MIP-10, TNF, HME. Col-1, Col-3, 
Strom-1, and Strom-2 were compared with data collected from 
dot blot analysis. Results (not shown) were in close agreement 
and confirmed our earlier observations on the use of the 
microarray method for the quantitation of gene expression (3). 

Expression Profiles in Primary Chondrocytes and Synovio- 
cytes of Human RA Tissue. Given the sensitivity and the 
specificity of this method, expression profiles of primary 
synoviocytes and chondrocytes from diseased tissue were 
examined. Without prior exposure to inducing agents, low level 
expression of c-yu/i, GCSF, IL-3, TNF-0, MIF. and R ANTES 
(regulated upon activation, normal T cell expressed and se- 
creted) was seen as well as expression of MMPs, GelA, 
Strom-1, Col-1, and the three TIMPs. In this case, Col-2 
hybridization was considered to be nonspecific because the 
second Col-2 target taken from the 3' end of the gene gave no 



A. Human synovial fibroblasts B. Human articular chondrocytes 
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signal. Treatment more so with PMA and IL-1, than TNF and 
IL-1, produced a dramatic up-regulation in expression of 
several genes in both of these primary cell types. These genes 
are as follows: the cytokine IL-6, the chemokines IL-8 and 
Gro-la, and the MMPs; Strom-1, Col-1, Col-3, and HME: and 
the adhesion molecule, vascular cell adhesion molecule 1 
(VCAM-1). The surprise again is HME expression in these 
primary cells, for reasons discussed above. From these results, 
the expression profiles of synoviocytes and the chondrocytes 
appear very similar; the differences are more quantitative than 
qualitative. Treatment of the primary chondrocytes with the 
anabolic growth factor TGF-0 had an interesting profile in that 
it produced a remarkable down-regulation of genes expressed 
in both the untreated and induced state (Fig. 4). 

Given the demonstrated effectiveness of this technology, a 
comparative analysis of two different inflammatory disease 
states was conducted with probes made from RA tissue and 
IBD samples. RA samples were from late stage rheumatoid 
synovial tissue, and IBD specimens were obtained from in- 
flamed lower intestinal mucosa of patients with Crohn disease. 
With both the 96-element known gene micToarray and the 
1000-gene microarray of cDNAs selected from a peripheral 
human blood cell library (3), distinct differences in gene 
e*nre.«ion patterns were evident. On the 96-gene array, RA 
tissue samples from different affected individuals gave similar 
profiles (data not shown) as did different samples from the 
same individual (Fig. 5). These patterns were notably similar 
to those observed with primary synoviocytes and chondrocytes 
(Fig. 4). Included in the list of prominently up-regulated genes 
are IL-6, the MMPs Strom-1, Col-1, GelA, HME, and in 
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Fig. 5. Expression profiles of RA tissue (A) and IBD tissue (B) 
iRNA from R A tissue samples obtained from the same individual was 

; ,! 3 T j .H dirretiv after cxcis?on j'R A "1 W» or maintained m "u^urr 



certain samples PUMP, TIMPs, particularly TIMP-1 and 
T1MP-3, and the adhesion molecule VCAM. Discernible levels 
of macrophage chemotactic protein 1 (MCP-1), MIF and 
RANTES were also noted. IBD samples were in comparison, 
rather subdued although IL-1 converting enzyme (ICE), 
TIMP-1. and MIF were notable in all the three different IBD 
samples examined here. In IBD-A. one of three individual 
samples, ICE, VCAM, Groa, and MMP expression was more 
pronounced than in the others. 

We also made use of a peripheral blood cDNA library (3) 
to identify genes expressed by lymphocytes infiltrating the 
inflamed tissues from the circulating blood. With the 1046- 
element array of randomly selected cDNAs from this library, 
probes made from R A and IBD samples showed hybridizations 
to a large number of genes. Of these, many were common 
between the two disease tissues while others were differentially 
expressed (data not shown). A complete survey of these genes 
was beyond the scope of this study, but for this report we 
picked three genes that were up-regulated in the RA tissue 
relative to IBD. These cDNAs were sequenced and identified 
by comparison to the GenBank database. They are TIMP-1, 
apoferritin light chain, and manganese superoxide dismutase 
(MnSOD). Differential expression of MnSOD was only ob- 
served in samples of RA tissue explants maintained in growth 
fnr anvwhere between 2 to 16 nr. These 
results also indicate that the expression profile of genes can be 
altered when explants are transferred to culture conditions. 

DISCUSSION 

The speed, ease, and feasibility of simultaneously monitoring 
differential expression of hundreds of genes with the cDNA 
microarray based system (1-3) is demonstrated here in the 
analysis of a complex disease such as RA. Many different cell 
types in the RA tissue; macrophages, lymphocytes, plasma cells, 
neutrophils, synoviocytes, chondrocytes, etc. are known to con- 
tribute to the development of the disease with the expression of 
gene products known to be proinflammatory. They include the 
cytokines, chemokines, growth factors, MMPs, eicosanoids, and 
others (7, 11-14), and the design of the 96-element known gene 
microarray was based on this knowledge and depended on the 
availability of the genes. The technology was validated by con- 
firming earlier observations on the expression of TNF by the 
monocyte cell line MM6, and of Col-1 and Col-3 expression in the 
chondrosarcoma cells and articular chondrocytes (9, 12). In our 
time-dependent survey the chronological order of gene activities 
in and between gene families was compared and the results have 
provided unprecedented profiles of the cytokines (TNF, IL-1, 
IL-6, GCSF, and MIF), chemokines (MlP-la, MIP-10, IL-8, and 
Gro-1), certain transcription factors, and the matrix metal- 
loprotemases (GelA, Strom-1, Col-1, Col-3, HME) in the mac- 
rophage cell line MM6 and in the SW1353 chondrosarcoma cells. 

Earlier reports of cytokine production in the diseased state had 
established a model in which TNF is a major participant in RA. 
Its expression reportedly preceded that of the other cytokines and 
effector molecules (4). Our results strongly support these results 
as demonstrated in the time course of the MM6 cells where TNF 
induction preceded that of IL-la and IL-^ followed by IL-6 and 
GCSF. These expression profiles demonstrate the utility of the 
microarrays in determining the hierarachy of signaling events. 

In the SW1353 chondrosarcoma cells, all the known MMPs and 
TIMPs were examined simultaneously. HME expression was 
discovered, which previously had been observed in only the 
stromal cells and alveolar macrophages of smoker's lungs and in 
placental tissue. Its presence in cells of the RA tissue is mean- 
'••diii beczuK '■. ! t\ rj.P cnuse sieninc-ari! destruction of 
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the MMPs, but chemokines and cytokines were also produced by 
these cells. The effect of the anabolic growth factor TGF-0 was 
profoundly evident in demonstrating the down regulation of these 
catabolic activities. m 

RA tissue samples undeniably reflected profiles similar to 
the cell types examined. Active genes observed were IL-3, IL-6, 
ICE the MMPs including HME and TIMPs, chemokines IL-S, 
Groa, MIP, MIF, and R ANTES, and the adhesion molecule 
VCAM. Of the growth factors, fibroblast growth factor 0 was 
observed most frequently. In comparison, the expression 
patterns in the other inflammatory state (i.e., IBD) were not 
as marked as in the R A samples, at least as obtained from the 
tissue samples selected for this study. 

As an alternative approach, the 1046 cDNA microarray of 
randomly selected genes from a lymphocyte library was used to 
identify genes expressed in RA tissue (3). Marry genes on this 
array hybridized with probes made from both RA and IBD tissue 
samples. The results are not surprising because inflammatory 
tissue is abundantly supplied with cell types infiltrating from the 
circulating blood, made apparent also by the high levels of 
chemokine expression in RA tissue. Because of the magnitude of 
the effort required to identify all the hybridized genes, we have for 
this report chosen to describe only three differentially expressed 
genes mainly to verify this method of analysis. 

l>i tne large numocr ui gcnc» uujci »uu imw, « 
were already known as active participants in inflammatory dis- 
ease. These are TNF, 1L-1, IL-6, IL-S, GCSF, R ANTES, and 
VCAM. The novel participants not previously reported are 
HME, IL-3, ICE, and Groa. With our discovery of HME 
expression in RA, this gene becomes a target for drug interven- 
tion. ICE is a cysteine protease well known for its IL-10 process- 
ing activity (18), and recognized for its role in apoptotic cell death 
(19). Its expression in RA tissue is intriguing. IL-3 is recognized 
for its growth-promoting activity in hematopoietic cell lineages, is 
a product of activated T cells (20), and its expression in synovio- 
cytes and chondrocytes of R A tissue is a novel observation. 

Like IL-8, Groa, is a C-X-C subgroup chemokine and is a 
potent neutrophil and basophil chemoattractant. It down- 
regulates the expression of types I and III interstitial collagens 
(21, 22) and is seen here produced by the MM6 cells, in primary 
synoviocytes, and in RA tissue. With the presence of R ANTES, 
MCP, and MIP-10, the C-C chemokines (23) migration and 
infiltration of monocytes, particularly T cells, into the tissue is 
also enhanced (5) and aid in the trafficking and recruitment of 
leukocytes into the RA tissue. Their activation, phagocytosis, 
degranulation. and respiratory bursts could be responsible for 
the induction of MnSOD in R A. MnSOD is also induced by 
TNF and IL-1 and serves a protective function against oxida- 
tive damage. The induction of the ferritin light chain encoding 
gene in this tissue may be for reasons similar to those for 
MnSOD. Ferritin is the major intracellular iron storage protein 
and it is responsive to intracellular oxidative stress and reactive 
oxygen intermediates generated during inflammation (24, 25). 
The active expression of TIMP-1 in RA tissue, as detected by 
the 1000-element array, is no surprise because our results have 
repeatedly shown TIMP-1 to be expressed in the constitutive 
and induced states of RA cells and tissues. 

The suitability of the cDNA microarray technology for 
profiling diseases and for identifying disease related genes is 
well documented here. This technology could provide new 



targets for drug development and disease therapies, and in 
doing so allow for improved treatment of chronic diseases that 
are challenging because of their complexity. 
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Analysis of RNA 



A number of methods have been developed to quantitate, measure the size of 
and map the 5' and 3' termini of specific mRNA molecules in preparations of 
cellular RNA. These include: 

• Northern hybridization (RNA blotting), in which the size and amount of 
specific mRNA molecules in preparations of total or poly(A)* RNA are 
determined (Alwine et al. 1977, 1979). The RNA is separated according to 
size by electrophoresis through a denaturing agarose gel and is then 
transferred to activated cellulose (Alwine et al. 1977; Seed 1982b), nitro- 
cellulose (Goldberg 1980; Thomas 1980; Seed 1982a), or glass or nylon 
membranes (Bresser and Gillespie 1983) (see below). The RNA of interest 
is then located by hybridization with radiolabeled DNA or RNA followed by 
autoradiography. 

• Dot and slot hybridization, in which an excess of radiolabeled probe is 

hybridized to iUN a mat nas ueen nmiiuumtcu u« a oun« ^u^u. * v--— 

al. 1979; Thomas 1980; White and Bancroft 1982). Densitometric tracings 
of the resulting autoradiographs can allow comparative estimates of the 
amount of the target sequence in various preparations of RNA. 

• Mapping RNA using nuclease SI or ribonuclease, in which the precise 
positions of the 5' and 3' termini of the mRNA and the locations of splice 
junctions can be rigorously determined (Berk and Sharp 1977; Weaver and 
Weissmann 1979). Labeled or unlabeled RNA or DNA probes derived from 
various segments of the genomic DNA are hybridized to mRNA, often under 
conditions favoring the formation of DNA:RNA hybrids (Casey and David- 
son 1977). The products of the hybridization are then digested with 
nuclease SI or RNAase under conditions favoring digestion of single- 
stranded nucleic acids only. Analysis of the digestion products by gel 
electrophoresis yields important quantitative and qualitative information 
about the mRNA structure. 

• Primer extension, in which a small radiolabeled fragment of DNA is 
hybridized to the mRNA and used as a primer for reverse transcriptase. 
The resulting product should extend to the extreme 5' terminus of the 
mRNA, and thus the size of the product reflects the number of nucleotides 
from the position of the label to the 5' terminus of the mRNA. 

• Solution hybridization, in which the absolute concentration of the sequence 
of interest is calculated from the rate of hybridization of si small amount of 
a specific radioactive probe with a known quantity of purified cellular RNA 
(see, e.g., Roop et al. 1978; Durnam and Palmiter 1983). Alternatively, an 
excess of a radiolabeled probe is incubated with a known amount of RNA. 
The concentration of the RNA of interest can then be estimated from the 
amount of radioactivity that becomes resistant to nuclease Si (see, e.g., 
tt.,. ...]„... .,i hihi! Reach and Palmitpr 19R1- Williams et al 1986) 
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• Filter hybridization, in which purified cellular RNA is end-labeled with 32 P 
and hybridized to a large excess of the homologous DNA that has been 
immobilized on a solid support (Williams et al. 1986). 

Below we describe northern hybridization. Dot and slot hybridization of 
both crude and purified preparations of RNA are described beginning on page 
7.53; nuclease-Sl and RNAase analysis of specific hybrids, beginning on 
pages 7.58 and 7.71, respectively; and analysis of mRNA by primer extension, 
beginning on page 7.79. 
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